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1 Introduction 

These notes are based on a series of lectures given first at the University of Warwick in spring 2008 
and then at the Courant Institute in spring 2009. It is an attempt to give a reasonably self-contained 
presentation of the basic theory of stochastic partial differential equations, taking for granted basic 
measure theory, functional analysis and probability theory, but nothing else. Since the aim was 
to present most of the material covered in these notes during a 30-hours series of postgraduate 
lecture, such an attempt is doomed to failure unless drastic choices are made. This is why many 
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important facets of the theory of stochastic PDEs are missing from these notes. In particular, we 
do not treat equations with multiplicative noise, we do not treat equations driven Levy noise, we 
do not consider equations with 'rough' (that is not locally Lipschitz, even in a suitable space) 
nonlinearities, we do not treat measure-valued processes, we do not consider hyperbolic or elliptic 
problems, we do not cover Malliavin calculus and densities of solutions, etc. The reader who is 
interested in a more detailed exposition of these more technically subtle parts of the theory might 
be advised to read the excellent works [DPZ92b, DPZ96, PZ07, PR07, SS05]. 

Instead, the approach taken in these notes is to focus on semilinear parabolic problems driven 
by additive noise. These can be treated as stochastic evolution equations in some infinite-dimen- 
sional Banach or Hilbert space that usually have nice regularising properties and they already 
form (in my humble opinion) a very rich class of problems with many interesting properties. 
Furthermore, this class of problems has the advantage of allowing to completely pass under silence 
many subtle problems arising from stochastic integration in infinite-dimensional spaces. 

1.1 Acknowledgements 

These notes would never have been completed, were it not for the enthusiasm of the attendants of 
the course. Hundreds of typos and mistakes were spotted and corrected. I am particularly indebted 
to David Epstein and Jochen VoB who carefully worked their way through these notes when they 
were still in a state of wilderness. Special thanks are also due to Pavel Bubak who was running 
the tutorials for the course given in Warwick. 

2 Some Motivating Examples 

2.1 A model for a random string (polymer) 

Take N + 1 particles with positions u n immersed in a fluid and assume that nearest-neighbours 
are connected by harmonic springs. If the particles are furthermore subject to an external forcing 
F, the equations of motion (in the overdamped regime where the forces acting on the particle are 
more important than inertia, which can also formally be seen as the limit where the masses of the 
particles go to zero) would be given by 

k(ui - u Q ) + F(u ) , 

k(u n+ i + u n -i - 2u n ) + F(u n ) , n = 1, . . . , N - 1 , 
k(u N -i - u N ) + F(u N ) . 

This is a primitive model for a polymer chain consisting of N + 1 monomers and without self- 
interaction. It does however not take into account the effect of the molecules of water that would 
randomly 'kick' the particles that make up our string. Assuming that these kicks occur randomly 
and independently at high rate, this effect can be modelled in first instance by independent white 
noises acting on all degrees of freedom of our model. We thus obtain a system of coupled stochas- 
tic differential equations: 

duo = k{u\ — no) dt + F(uq) dt + a dwo(t) , 

du n = k(u n+ i + -u n _i - 2u n ) dt + F(u n ) dt + a dw n {t) , n = 1,. . . ,N — 1 , 
duM = k(uN-i — un) dt + F(un) dt + a dwN(t) . 



duo 

~~dt 

du n 

dt 
dux 
~dT 
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Formally taking the continuum limit (with the scalings k f» uN 2 and a rj n/JV), we can infer that 
if N is very large, this system is well-described by the solution to a stochastic partial differential 
equation 

du(x, t) = ud 2 u(x, t) dt + F(u(x, t)) dt + dW(x, t) , 

endowed with the boundary conditions d x u(0, t) = d x u(l,t) = 0. It is not so clear a priori what 
the meaning of the term dW(x, t) should be. We will see in the next section that, at least on a 
formal level, it is reasonable to assume that E dW ^' l) dW ^ ,s) = 6(x - y)S(t - s). The precise 
meaning of this formula will be discussed later. 

2.2 The stochastic Navier-Stokes equations 

The Navier-Stokes equations describing the evolution of the velocity field u(x, t) of an incom- 
pressible viscous fluid are given by 

^ = z/Au - (u ■ V)u - Vp + f , (2.1) 

complemented with the (algebraic) incompressibility condition div u = 0. Here, / denotes some 
external force acting on the fluid, whereas the pressure p is given implicitly by the requirement 
that div u = at all times. 

While it is not too difficult in general to show that solutions to (2.1) exist in some weak sense, 
in the case where x G R d with d > 3, their uniqueness is an open problem with a $1,000,000 
prize. We will of course not attempt to solve this long-standing problem, so we are going to 
restrict ourselves to the case d = 2. (The case d = 1 makes no sense since there the condition 
div u = would imply that u is constant. However, one could also consider the Burger's equation 
which has similar features to the Navier-Stokes equations.) 

For simplicity, we consider solutions that are periodic in space, so that we view u as a function 
from T 2 x R + to R 2 . In the absence of external forcing /, one can use the incompressibility 
assumption to see that 

4-/ \u(x,t)\ 2 dx = -2v f tr Du(x,t)*Du(x,t)dx < -2v f \u(x, t)\ 2 dx , 
dt Jt 2 Jt 2 Jt 2 

where we used the Poincare inequality in the last line (assuming that J r2 u(x, t) dx = 0). There- 
fore, by Gronwall's inequality, the solutions decay to exponentially fast. This shows that energy 
needs to be pumped into the system continuously if one wishes to maintain an interesting regime. 

One way to achieve this from a mathematical point of view is to add a force / that is randomly 
fluctuating. We are going to show that if one takes a random force that is Gaussian and such that 

Ef(x,t)f(y,s) = S(t-s)C(x-y), 

for some correlation function C then, provided that C is sufficiently regular, one can show that 
(2.1) has solutions for all times. Furthermore, these solutions do not blow up in the sense that one 
can find a constant K such that, for any solution to (2. 1), one has 

limsupE||u(£)|| 2 < K , 

for some suitable norm || • ||. This allows to provide a construction of a model for homogeneous 
turbulence which is amenable to mathematical analysis. 
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2.3 The stochastic heat equation 

In this section, we focus on the particular example of the stochastic heat equation. We will perform 
a number of calculations that give us a feeling for what the solutions to this equation look like. 
These calculations will not be completely rigorous but could be made so with some extra effort. 
Most tools required to make them rigorous will be introduced later in the course. 

2.3.1 Setup 

Recall that the heat equation is the partial differential equation: 

d t u = Au, u: R+ xR"-»R. (2.2) 

Given any bounded continuous initial condition uq : R ra — > R, there exists a unique solution u to 
(2.2) which is continuous on R + x R ra and such that u(0, x) = uq{x) for every x 6 R n . 
This solution is given by the formula 

1 f \*-y\ 2 

x) = (a 7W2 / e 4 * no( ^ dy ■ 
(47n) n ' z jR n 

We will denote this by the shorthand u(t, ■ ) = e At u by analogy with the solution to an Revalued 
linear equation of the type dtu = Au. 

Let us now go one level up in difficulty by considering (2.2) with an additional 'forcing term' 

/: 

d t u = Au + / , u:R + xR"^R. (2.3) 
From the variations of constants formula, we obtain that the solution to (2.3) is given by 

u(t, • ) = e At u + f e A(t - s) f{s, -)ds. (2.4) 
Jo 

Since the kernel defining e At is very smooth, this expression actually makes sense for a large 
class of distributions /. Suppose now that / is 'space-time white noise'. We do not define this 
rigorously for the moment, but characterise it as a (distribution-valued) centred Gaussian process 
£ such that E£(s, x)£(t, y) = S(t - s)6(x - y). 

The stochastic heat equation is then the stochastic partial differential equation 

d t u = Au + £ , u: R + xR"^R. (2.5) 

Consider the simplest case uq = 0, so that its solution is given by 

/■* 1 f l*-y\ 2 

u(t, x) = / : : — ^ / e ^-s) £(s, y) dy ds (2.6) 

Jo (4vr|t - s|)™/ 2 7r« ^ >w v 

This is again a centred Gaussian process, but its covariance function is more complicated. The aim 
of this section is to get some idea about the space-time regularity properties of (2.6). While the 
solutions to ordinary stochastic differential equations are in general a-Holder continuous (in time) 
for every a < 1/2 but not for a = 1/2, we will see that in dimension n = 1, u as given by (2.6) 
is only 'almost' 1/4-Holder continuous in time and 'almost' 1/2-Holder continuous in space. In 
higher dimensions, it is not even function- valued... The reason for this lower time-regularity is 
that the driving space-time white noise is not only very singular as a function of time, but also as 
a function of space. Therefore, some of the regularising effect of the heat equation is required to 
turn it into a continuous function in space. 
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Heuristically, the appearance of the Holder exponents 1/2 for space and 1/4 for time in di- 
mension n = 1 can be understood by the following argument. If we were to remove the term 
dtu in (2.5), then u would have the same time-regularity as £, but two more derivatives of space 
regularity. If on the other hand we were to remove the term Au, then u would have the sample 
space regularity as £, but one more derivative of time regularity. The consequence of keeping both 
terms is that we can 'trade' space-regularity against time-regularity at a cost of one time derivative 
for two space derivatives. Now we know that white noise (that is the centred Gaussian process 
r\ with Er](t)ri(s) = S(t — s)) is the time derivative of Brownian motion, which itself is 'almost' 
1/2-Holder continuous. Therefore, the regularity of i] requires 'a bit more than half a derivative' 
of improvement if we wish to obtain a continuous function. 

Turning back to £, we see that it is expected to behave like r] both in the space direction and 
in the time direction. So, in order to turn it into a continuous function of time, roughly half of a 
time derivative is required. This leaves over half of a time derivative, which we trade against one 
spatial derivative, thus concluding that for fixed time, u will be almost 1/2-Holder continuous in 
space. Concerning the time regularity, we note that half of a space derivative is required to turn 
£ into a continuous function of space, thus leaving one and a half space derivative. These can be 
traded against 3/4 of a time derivative, thus explaining the 1/4-Holder continuity in time. 

In Section 5.1, we are going to see more precisely how the space-regularity and the time- 
regularity interplay in the solutions to linear SPDEs, thus allowing us to justify rigorously this 
type of heuristic arguments. For the moment, let us justify it by a calculation in the particular case 
of the stochastic heat equation. 



2.3.2 A formal calculation 

Define the covariance for the solution to the stochastic heat equation by 

C(s,t,x,y) = Eu(s,x)u(t,y) , (2.7) 

where u is given by (2.6). 

By (statistical) translation invariance, it is clear that C(s, t, x, y) = C(s, t, 0, x — y). Using 
(2.6) and the expression for the covariance of £, one has 



C(s,t,0,x) 



rA^ E / / / / 1 i^H e 4< '~ r) ^^iir.y^ir'^y^dydy'dr'dr 

(47r) n Jo Jo Jr"Jr" \s — r'\ n ' z \t — r\ n ' z 



1 f sM f 1 l*-y| 2 \y\ 2 

= — / / — 777- r^-e 4 <*-' - > 4 < 3 - r > dy dr 

(4vr) n Jo Jr" \s - r\ n / 2 \t - r\ n / 2 



I f sAt f I 



(47r) n Jo 7r™ \s - r\ n / 2 \t - r\ n / 2 

X eXP V~4(t-r) ~ 2(t - r) ~ 4(s - r) ~ 4(t - r)) dydr ' 
The integral over y can be performed explicitly by 'completing the square' and one obtains 

C(s,t, 0, x) = 2' n (s + t- 2ry n/2 exp (- ' 1 J dr 
Jo v 4(s + t-2r)/ 

.. r"/ 2 exp(-LL)^. (2.8) 
l\ s -t\ v 4£ / 

We notice that the singularity at £ = is integrable if and only if n < 2, so that C(t, t, 0, 0) is 
finite only in the one-dimensional case. We therefore limit ourselves to this case in the sequel. 



= 2 -(n+l) j S r n/2 

J\s-t\ v 4 
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Remark 2.1 Even though the random variable u defined by (2.6) is not function-valued in dimen- 
sion 2, it is 'almost' the case since the singularity in (2.8) diverges only logarithmically. The sta- 
tionary solution to (2.5) is called the Gaussian free field and has been the object of intense studies 
over the last few years, especially in dimension 2. Its interest stems from the fact that many of its 
features are conformally invariant (as a consequence of the conformal invariance of the Laplacian), 
thus linking probability theory to quantum field theory on one hand and to complex geometry on 
the other hand. The Gaussian free field also relates directly to the Schramm-Loewner evolutions 
(SLEs) for the study of which Werner was awarded the Fields medal in 2006, see [Law04, SS06]. 
For more information on the Gaussian free field, see for example the review article by Sheffield 
[She07]. 

The regularity of u is determined by the behaviour of C near the 'diagonal' s = t, x = y. We 
first consider the time regularity. We therefore set x = and compute 

C(s,t, 0,0) = - f S+t r 1/2 di = \(\a + t\* - \s-t\h) . 
4 J\ s -t\ 

This shows that, in the case n = 1 and for s « t, one has the asymptotic behaviour 

E|u(s,0) - u(t,0)\ 2 w \t - s|3 . 

Comparing this with the standard Brownian motion for which E\B(s) — B(t)\ 2 = \t — s\, we 
conclude that the process t u(t, x) is, for fixed x, almost surely a-Holder continuous for any 
exponent a < 1/4 but not for a = 1/4. This argument is a special case of Kolmogorov's cele- 
brated continuity test, of which we will see a version adapted to Gaussian measures in Section 3.1. 

If, on the other hand, we fix s = t, we obtain (still in the case n = 1) via the change of 
variables z = \x\ 2 /At, the expression 

\x\ f°° 3 

C(M,0,x) = y j^z-ie-zdz. 

Integrating by parts, we get 

\/t W 2 \x\ f°° 1 
C(t, t, 0, x) = 1 -e~~ + i-1 / z ~2e~ z dz , 

So that to leading order we have for small values of x: 

C(t, t, 0, x) w ^ + M r z'\e- z dz = Vt + + 0(|x| 2 /8Vt) . 

4 4 Jo 4 

This shows that, at any fixed instant t, the solution to (2.5) looks like a Brownian motion in space 
over lengthscales of order t 1 ! 2 . Note that over such a lengthscale the Brownian motion fluctuates 
by about i 1 / 4 , which is precisely the order of magnitude of E\u(t, x)\. 

2.4 What have we learned? 

1. At a 'hand- waving' level, we have forced our equation with a term that has a temporal 
evolution resembling white noise, so that one would naively expect its solutions to have 
a temporal regularity resembling Brownian motion. However, for any fixed location in 
space, the solution to the stochastic heat equation has a time-regularity which is only almost 
Holder- \ , as opposed to the almost Holder- \ time-regularity of Brownian motion. 
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2. Unlike the solutions to an ordinary parabolic PDE, the solutions to a stochastic PDE tend to 
be spatially 'rough' . It is therefore not obvious a priori how the formal expression that we 
obtained is to be related to the original equation (2.5), since even for positive times, the map 
x u(t, x) is certainly not twice differentiable. 

3. Even though the deterministic heat equation has the property that e At u — > as t — ► oo for 
every u € L 2 , the solution to the stochastic heat equation has the property that E\u(x, i)| 2 — > 
oo for fixed x as t — > oo. This shows that in this particular case, the stochastic forcing term 
pumps energy into the system faster than the deterministic evolution can dissipate it. 

Exercise 2.2 Perform the same calculation, but for the equation 

d t u = An - au + £ , u: R + xR^R. 

Show that as t — ► oo, the law of its solution converges to the law of an Ornstein-Uhlenbeck process 
(if the space variable is viewed as 'time'): 

lim Eu(t, x)u(t, y) = Ce~ c \ x ~ y \ . 
Compute the constants C and c as functions of the parameter a. 
3 Gaussian Measure Theory 

This section is devoted to the study of Gaussian measures on general Banach spaces. Throughout 
this section and throughout most of the remainder of these notes, we will denote by B an arbitrary 
separable Banach space. Recall that a space is separable if it contains a countable dense subset, 
see for example the monograph [Yos95]. This separability assumption turns out to be crucial for 
measures on B to behave in a non-pathological way. It can be circumvented by trickery in most 
natural situations where non-separable spaces arise, but we choose not to complicate our lives by 
considering overly general cases in these notes. Another convention that will be used throughout 
these notes is that all of the measures that we consider are Borel measures, meaning that we 
consider every open set to be measurable. 

One additional assumption that would appear to be natural in the context of Gaussian measure 
theory is that B be reflexive (that is B** = B). This is for example because the mean of a measure 
H appears at first sight to be an element of B** rather than of B, since the natural 1 way of defining 
the mean m of p, is to set mil) = J B £(x) /j,(dx) for any £ G B*. This turns out not to be a problem, 
since the mean of a Gaussian measure on a separable Banach space B is always an element of 
B itself, see the monograph [Bog98]. However this result is not straightforward to prove, so we 
will take here the more pragmatic approach that whenever we consider Gaussian measures with 
non-zero mean, we simply take the mean meBas given. 

Example 3.1 Before we proceed, let us just mention a few examples of Banach spaces. The 
spaces L V {M. , v) (with (M. , v) any countably generated measure space like for example any Polish 
space equipped with a Radon measure v) forp G (1, oo) are both reflexive and separable. However, 
reflexivity fails in general for L 1 spaces and both properties fail to hold in general for L°° spaces 
[Yos95]. The space of bounded continuous functions on a compact space is separable, but not 
reflexive. The space of bounded continuous functions from R n to R is neither separable nor 



'Without further assumption, we do not know a priori whether x ^ \\x\\ is integrable, so that the more natural 
definition m = J B x fj,(dx) is prohibited. 
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reflexive, but the space of continuous functions from R n to R vanishing at infinity is separable. 
(The last two statements are still true if we replace R n by any locally compact complete separable 
metric space.) Hilbert spaces are obviously reflexive since H* = H for every Hilbert space H by 
the Riesz representation theorem [Yos95]. There exist non-separable Hilbert spaces, but they have 
rather pathological properties and do not appear very often in practice. 

We start with the definition of a Gaussian measure on a Banach space. Since there is no 
equivalent to Lebesgue measure in infinite dimensions (one could never expect it to be er-additive), 
we cannot define it by prescribing the form of its density. However, it turns out that Gaussian 
measures on R™ can be characterised by prescribing that the projections of the measure onto any 
one-dimensional subspace of R™ are all Gaussian. This is a property that can readily be generalised 
to infinite-dimensional spaces: 

Definition 3.2 A Gaussian probability measure /j, on a Banach space B is a Borel measure such 
that £*p, is a real Gaussian probability measure on R for every linear functional I: B — > R. (Dirac 
measures are also considered to be Gaussian measures, but with zero covariance.) We call it 
centred if t p, is centred for every t. 

Remark 3.3 We used here the notation /*// for the push-forward of a measure p, under a map /. 
This is defined by the relation (f*p)(A) = fi(f~\A)). 

Remark 3.4 We could also have defined Gaussian measures by imposing that T*fi is Gaussian for 
every bounded linear map T: B —> R™ and every n. These two definitions are equivalent because 
probability measures on R n are characterised by their Fourier transforms and these are constructed 
from one-dimensional marginals, see Proposition 3.9 below. 

Exercise 3.5 Let {£ n } be a sequence of i.i.d. M(0, 1) random variables and let {a n } be a sequence 
of real numbers. Show that the law of (ao£o> ai^i , • • •) determines a Gaussian measure on I 2 if 
and only if J2n>o a n < °°- 

One first question that one may ask is whether this is indeed a reasonable definition. After all, it 
only makes a statement about the one-dimensional projections of the measure \i, which itself lives 
on a huge infinite-dimensional space. However, this turns out to be reasonable since, provided 
that B is separable, the one-dimensional projections of any probability measure carry sufficient 
information to characterise it. This statement can be formalised as follows: 

Proposition 3.6 Let B be a separable Banach space and let p and v be two probability Borel 
measures on B. If I* p = tv for every I G B*, then (i = v. 

Proof. Denote by Cyl(B) the algebra of cylindrical sets on B, that is A G Cyl(£>) if and only 
if there exists n > 0, a continuous linear map T:B — > R n , and a Borel set A C R ra such that 
A = T~ X A. It follows from the fact that measures on R n are determined by their one-dimensional 
projections that p,(A) = v{A) for every A G Cy\(B) and therefore, by a basic uniqueness result 
in measure theory (see Lemma II.4.6 in [RW94] or Theorem 1.5.6 in [Bog07] for example), for 
every A in the cr-algebra £(B) generated by Cy\(B). It thus remains to show that £(B) coincides 
with the Borel cr-algebra of B. Actually, since every cylindrical set is a Borel set, it suffices to 
show that all open (and therefore all Borel) sets are contained in £{B). 

Since B is separable, every open set U can be written as a countable union of closed balls. (Fix 
any dense countable subset {x n } of B and check that one has for example U = Urr n e(y B(x n , r n ), 
where r n = ^sup{r > : B(x n ,r) C £7} and B(x,r) denotes the closed ball of radius r 
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centred at x.) Since £{B) is invariant under translations and dilations, it remains to check that 
-6(0, 1) G £(B). Let {x n } be a countable dense subset of {i £ B : ||x|| = 1} and let £ n be 
any sequence in B* such that \\£ n \\ = 1 and £ n (x n ) = 1 (such elements exist by the Hahn-Banach 
extension theorem [Yos95]). Define now K = Pln>o{ x e & '■ I^n0*0| < !}• It i s c l ear that 
K G £ (B), so that the proof is complete if we can show that K = B(0, 1). 

Since obviously B(0, 1) C K, it suffices to show that the reverse inclusion holds. Let y G 
B with ||y|| > 1 be arbitrary and set y = y/\\y\\. By the density of the x n 's, there exists a 
subsequence Xk n such that \\xk n — y\\ < \, say, so that lk n (y) > 1 - ^- By linearity, this implies 
that £k n (y) > || y — ^)» so that there exists a sufficiently large n so that £k n (y) > 1- This shows 
that y g" K and we conclude that K C .6(0, 1) as required. □ 

From now on, we will mostly consider centred Gaussian measures, since one can always re- 
duce oneself to the general case by a simple translation. Given a centred Gaussian measure p, we 
define a map C M : B* x B* — > R by 

C J£, £') = { e(x)£'(x) ii{dx) . (3.1) 

JB 

Remark 3.7 In the case B = R n , this is just the co variance matrix, provided that we perform the 
usual identification of R" with its dual. 

Remark 3.8 One can identify in a canonical way C M with an operator C^.B* — > B** via the 
identity C „{£)(£') = C „{£,£'). 

The map C M will be called the Covariance operator of fi. It follows immediately from the 
definitions that the operator C M is bilinear and positive definite, although there might in general 
exist some £ such that C^, £) = 0. Furthermore, the Fourier transform of /i is given by 

m = J z U(x) Kdx) = exp(-iC^, £)) , (3.2) 

where £ G B*. This can be checked by using the explicit form of the one-dimensional Gaussian 
measure. Conversely, (3.2) characterises Gaussian measures in the sense that if n is a measure 
such that there exists satisfying (3.2) for every £ G B* , then ^ must be centred Gaussian. The 
reason why this is so is that two distinct probability measures necessarily have distinct Fourier 
transforms: 

Proposition 3.9 Let fi and v be any two probability measures on a separable Banach space B. If 
fi(£) = v(£)for every £ G B*, then fi = v. 

Proof. In the particular case B = R n , if 92 is a smooth function with compact support, it follows 
from Fubini's theorem and the invertibility of the Fourier transform that one has the identity 

f <p(x)n(dx) = — \- [ [ (p(k)e~ ikx dk n(dx) = — ^— f <p(k) fl(-k) dk , 

jR n (27T) n JR" JR" (2TT) n JR" 

so that, since bounded continuous functions can be approximated by smooth functions, p, is indeed 
determined by ft. The general case then follows immediately from Proposition 3.6. □ 

As a simple consequence, we have the following trivial but useful property: 
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Proposition 3.10 Let n be a Gaussian measure on B and, for every ip G R, define the 'rotation' 
R v :B 2 ^B 2 by 

R<p(x, y) = (x sin ip + y cos ip, x cos p — y sin ip) . 
Then, one has -R* (/U <8> //) = p, ® fx. 

Proof. Since we just showed in Proposition 3.9 that a measure is characterised by its Fourier 
transform, it suffices to check that p,® po = p® p, which is an easy exercise. □ 

3.1 A-priori bounds on Gaussian measures 

We are going to show now that the operator C u has to be bounded, as a straightforward conse- 
quence of the fact that x i-> ||a?|| 2 is integrable for any Gaussian measure. Actually, we are going 
to show much more, namely that there always exists a constant a > such that exp(a||x|| 2 ) is 
integrable! In other words, the norm of any Banach-space valued Gaussian random variable has 
Gaussian tails, just like in the finite-dimensional case. Amazingly, this result uses the Gaussianity 
of the measure only indirectly through the rotation invariance shown in Proposition 3.10, and even 
this property is only used for rotations by the angle p = ir/4. This is the content of the following 
fundamental result in the theory of Gaussian measures: 

Theorem 3.11 (Fernique, 1970) Let p. be any probability measure on a separable Banach space 
B such that the conclusion of Proposition 3.10 holds for p = n/4. Then, there exists a > such 
that f B exp(a||x|| 2 ) p(dx) < oo. 

Proof. Note first that, from Proposition 3.10, one has for any two positive numbers t and r the 
bound 



fJ-(\\x\\ < T)fi(\\x\\ > t) = / / n(dx) p,(dy) = / fi(dx) fi(dy) 

J\\x\\<t J\\y\\>t J\\^\\< T J\\£+M\\ >t 

< f J AdxMdy) = m(H >^) 2 . (3-3) 

J \\ x \\>— J \\y\\>-j5 

In order to go from the first to the second line, we have used the fact that the triangle inequality 
implies 

min{||x||,||y||} > l(\\x + y\\ - , 

so that \\x + y\\ > \[2t and \x — y|| < \/2r do indeed imply that both ||ar|| and ||y|| are greater 
than Since ||x|| is ^i-almost surely finite, there exists some r > such that fi(\\x\\ < r) > |. 

Set now to = t and define t n for n > recursively by the relation t n = *"^~ r . It follows from 
(3.3) that 

MINI > Wi) < fi(\\x\\ > *=±p) 2 MI|z|| < r) < fMlkll > t n ) 2 ■ 

Setting y n = |/i(||x|| > t n+ \), this yields the recursion y n +i < y\ with y$ < 1/3. Applying this 
inequality repeatedly, we obtain 

.iiii > 3 3 nn 1 -i on . on 

Ml|x|| > t n ) = -y n < -yl <-^ 1 ' 2 <3~ 2 . 

On the other hand, one can check explicitly that t n = ^^rf^ 7 " - ' ^ 2 + v / 2)t, so that in 
particular t n +i < 2 n / 2 • 5r. This shows that one has the bound 

— 

l4.\\x\\ >t n )<3 aSTT , 
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implying that there exists a universal constant a > such that the bound /i(||x|| > t) < 
exp(— 2at 2 /r 2 ) holds for every t > t. Integrating by parts, we finally obtain 

r /(tIItII 2 n in r°° t 2 

/ exp(^4^J n{dx) <e a + -^ / te a ^^{\\x\\ > t)dt 

/oo „ 
te- at dt < oo , (3.4) 

which is the desired result. □ 



As an immediate corollary of Fernique's theorem, we have 



Corollary 3.12 There exists a constant ||C M || < oo such that 0^(1, £') < \\C^\\ \\£\\ \\£'\\ for any 
£, £' £ B*. Furthermore, the operator defined in Remark 3.8 is a continuous operator from B* 
toB. 



Proof. The boundedness of C M implies that is continuous from B* to £>**. However, £>** might 
be strictly larger than B in general. The fact that the range of C M actually belongs to B follows 
from the fact that one has the identity 

CJ = [ x £(x) fj,(dx) . (3.5) 

JB 

Here, the right-hand side is well-defined as a Bochner integral [Boc33, Hil53] because B is as- 
sumed to be separable and we know from Fernique's theorem that ||x|| 2 is integrable with respect 

tO jJL. □ 



Remark 3.13 In Theorem 3.11, one can actually take for a any value smaller than 1/(2||C M ||). 
Furthermore, this value happens to be sharp, see [Led96, Thm 4.1]. 

Another consequence of the proof of Fernique's theorem is an even stronger result, namely 
all moments (including exponential moments!) of the norm of a Banach-space valued Gaussian 
random variable can be estimated in a universal way in terms of its first moment. More precisely, 
we have 



Proposition 3.14 There exist universal constants a,K > with the following properties. Let p, 
be a Gaussian measure on a separable Banach space B and let f: R + — > R + be any measurable 
function such that f(x) < Cf exp(ax 2 ) for every x > 0. Define furthermore the first moment of n 
by M = j B \\x\\ fj,(dx). Then, one has the bound j B /(||x||/M) p(dx) < KCf. 

In particular, the higher moments of p are bounded by f B \\x\\ 2n fi(dx) < n\Ka~ n M 2n . 



Proof. It suffices to note that the bound (3.4) is independent of r and that by Chebychev's in- 
equality, one can choose for example r = AM. The last claim then follows from the fact that 

„«i 2 > a n x 2n „ 
e - n! • U 

Actually, the covariance operator is more than just bounded. If B happens to be a Hilbert 
space, one has indeed the following result, which allows us to characterise in a very precise way 
the set of all centred Gaussian measures on a Hilbert space: 
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Proposition 3.15 If B = TC is a Hilbert space, then the operator C^-.TC — > TC defined as before 
by the identity {C^h, k) = C^(h, k) is trace class and one has the identity 

I \\x\\ 2 p,(dx) = trCn . (3.6) 
Jn 

(Here, we used Riesz's representation theorem to identify TC with its dual.) 

Conversely, for every positive trace class symmetric operator K:TC — > TC, there exists a Gaus- 
sian measure fionTC such that = K. 

Proof. Fix an arbitrary orthonormal basis {e n } of TC. We know from Theorem 3.11 that the second 
moment of the norm is finite: f H \\h\\ 2 fi(dh) < oo. On the other hand, one has 

^ oo oo 

/ \\h\\ 2 n(dh) = J2 / (h,e n ) 2 f i{dh) = Y,(en,C^e n )=trC fl , 

JH n=l jH n=l 

which is precisely (3.6). To pull the sum out of the integral in the first equality, we used Lebegue's 
dominated convergence theorem. 

In order to prove the converse statement, since K is compact, we can find an orthonormal 
basis {e n } of TC such that Ke n = \ n e n and \ n > 0, J2n < oo. Let furthermore {£ n } be 
a collection of i.i.d. Af(0, 1) Gaussian random variables (such a family exists by Kolmogorov's 
extension theorem). Then, since £) n A n E^ = tr K < oo, the series X^n V^n^n^n converges in 
mean square, so that it has a subsequence converging almost surely in TC. One can easily check 
that the law of the limiting random variable is Gaussian and has the requested covariance. □ 

No such precise characterisation of the covariance operators of Gaussian measures exists in 
Banach spaces. One can however show that C M is at least a little bit better than bounded, namely 
that it is always a compact operator. We leave this statement as an exercise for the interested 
reader, since we will not make any use of it in these notes: 

Exercise 3.16 Show that in the case of a Gaussian measure /iona general separable Banach space 
B, the covariance operator C^:B* — ► B is compact in the sense that it maps the unit ball on B* 
into a compact subset of B. Hint: Proceed by contradiction by first showing that if C M wasn't 
compact, then it would be possible to find a constant c > and a sequence of elements {lk}k>o 
such that \\£k\\ = 1, C^iik^j) = for k / j, and C^(£k,ik) > c for every k. Conclude that if 
this was the case, then the law of large numbers applied to the sequence of random variables i n (x) 
would imply that sup fc>0 £k(x) = oo /i-almost surely, thus obtaining a contradiction with the fact 
that sup k>0 £k(x) < \\x\\ < oo almost surely. 

In many situations, it is furthermore helpful to find out whether a given covariance structure 
can be realised as a Gaussian measure on some space of Holder continuous functions. This can be 
achieved through the following version of Kolmogorov's continuity criterion, which can be found 
for example in [RY94, p. 26]: 

Theorem 3.17 (Kolmogorov) For d > 0, let C: [0, l] d x [0, l] d — > R be a symmetric function 
such that, for every finite collection {xi} 7 j ^ =1 of points in [0, l] d , the matrix Cij = C(xi,Xj) is 
positive definite. If furthermore there exists a > and a constant K > such that C(x,x) + 
C(y,y) — 2C(x,y) < K\x — y\ 2a for any two points x,y G [0, l] d then there exists a unique 
centred Gaussian measure \x on C([0,l] d ,R) such that 



I f(x)f(y)»(df) = C(x,y), 

C([0,l] d ,R) 



(3.7) 
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for any two points x, y G [0, l] d . Furthermore, for every (3 < a, one has /i(C /J ([0,l] d ,R)) = I, 
where C^([0, l] d , R) is space of /3-Holder continuous functions. 

Proof. Set B = C([0, l] d ,R) and 0* its dual, which consists of the set of Borel measures with 
finite total variation [Yos95, p. 119]. Since convex combinations of Dirac measures are dense (in 
the topology of weak convergence) in the set of probability measures, it follows that the set of 
linear combinations of point evaluations is weakly dense in B*. Therefore, the claim follows if 
we are able to construct a measure p on B such that (3.7) holds and such that, if / is distributed 
according to p, then for any finite collection of points {x{\ C [0, l] d , the joint law of the f(xi) is 
Gaussian. 

By Kolmogorov's extension theorem, we can construct a measure po on X = R [0 > 1] endowed 
with the product cr-algebra such that the laws of all finite-dimensional marginals are Gaussian and 
satisfy (3.7). We denote by X an A'-valued random variable with law pq. At this stage, one would 
think that the proof is complete if we can show that X almost surely has finite /3-Holder norm. 
The problem with this statement is that the /3-Holder norm is not a measurable function on XI 
The reason for this is that it requires point evaluations of X at uncountably many locations, while 
functions that are measurable with respect to the product cr-algebra on X are allowed to depend 
on at most countably many function evaluations. 

This problem can be circumvented very elegantly in the following way. Denote by V C [0, l] d 
the subset of dyadic numbers (actually any countable dense subset would do for now, but the 
dyadic numbers will be convenient later on) and define the event VLp by 

Q,p = {X : X(x) = lim X(y) exists for every x G [0, l] d and X belongs to C /3 ([0, l] d ,R)} . 

yeT> 

Since the event fig can be constructed from evaluating X at only countably many points, it is a 
measurable set. For the same reason, the map given by 

| otherwise 

is measurable with respect to the product cr-algebra on X (and the Borel cr-algebra on C 13 ), so that 
the claim follows if we can show that yuo(fig) = 1 for every f5 < a. (Take p = L*po-) Denoting 
the /3-Holder norm of X restricted to the dyadic numbers by Mp(X) — sup^,^ . Xj y^x> 

{\X(x) - 

X(y)\/\x — yf}, we see that fig can alternatively be characterised as Qp = {X : Mp(X) < 00}, 
so that the claim follows if we can show for example that EM^(X) < 00. 

Denote by V m C V the set of those numbers whose coordinates are integer multiples of 2~ m 
and denote by A m the set of pairs x, y G V m such that \x — y\ = 2~ m . In particular, note that A m 
contains at most 2 (m+2)d such pairs. 

We are now going to make use of our simplifying assumption that we are dealing with Gaus- 
sian random variables, so that pth moments can be bounded in terms of second moments. More 
precisely, for every p > 1 there exists a constant C p such that if X is a Gaussian random variable, 
then one has the bound E|X|p < C p (E\X\ 2 ) p/2 . Setting K m (X) = sup x yeAm \X(x)-X(y)\ and 
fixing some arbitrary (3 1 G (/3, a), we see that for p > 1 large enough, there exists a constant K p 
such that 

EKP t (X)< ]T E\X(x) - X(y)\P < C p ]T (E\X(x) - X(y)\ 2 ) p/2 

x,y£A m x,y£A m 

= C P J2 ip{x, x) + C(y, y) - 2C(x, y)f' 2 < C p 2^ d ~ am P 

x,y£A m 
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< C p 2-P' mp , 

for some constants C p . (In order to obtain the last inequality, we had to assume that p > 

which can always be achieved by some value of p independent of m since we assumed that (3' < 

a.) Using Jensen's inequality, this shows that there exists a constant K such that the bound 

EK m (X) < K2- I3 ' m (3.8) 

holds uniformly in m. Fix now any two points x, y G V with x ^ y and denote by m the largest 
m such that \x — y\ < 2~ m . One can then find sequences {x n } n > mo and {y n }n>rn with the 
following properties: 



1. One has lim„_ 



x and lim n _ 



.y n = y- 



2. Either x 



y mo or both points belong to the vertices of the same 'dyadic hypercube' in 
V mo , so that they can be connected by at most d 'bonds' in A mo . 

3. For every n > uiq, x n and x n+ \ belong to the vertices of the same 'dyadic hypercube' 
in V n+ \, so that they can be connected by at most d 'bonds' in A n+ i and similarly for 

(y n ,yn+i)- 

One way of constructing this sequence is to order elements in V m by 
lexicographic order and to choose x n = max{x G V n : aij < Xj Vj}, as 
illustrated in the picture to the right. This shows that one has the bound 

oo 

\X(x) - X(y)\ < \X(x mo ) - Y(x mo )\ + Yl \X(x n+ i) - X(x n )\ 



n=mo 



+ \X(y n+ i) - X(y n )\ 

n=mo 

oo oo 

< dK mo (X) + 2dJ2 K n+l {X) < 2d K n(X) ■ 

n=mo n=mo 

Since tuq was chosen in such a way that \x — y\ > 2~ m °~ 1 , one has the bound 

oo oo 

M p {X) < 2d sup 2 /3(m+1) J2 K n (X) < 2? +1 d 2< 3n K n {X) . 

m >0 n=m n=0 

It follows from this and from the bound (3.8) that 

oo oo 

E|Mg(X)| < 2 l3+l d 2 (3n EK n (X) < 2< 3+1 dK ^ 2 (/3 " /3 ' ) " < oo 







t * 

X 






• 




» — i 






• * 


» i 


1 





n=0 



n=0 



since j3' was chosen strictly larger than (3. 



□ 



Combining Kolmogorov's continuity criterion with Fernique's theorem, we note that we can 
apply it not only to real-valued processes, but to any Gaussian Banach-space valued process: 

Proposition 3.18 Let B be a separable Banach space and let {X(x)} x ^ be a collection of 
B-valued Gaussian random variables such that 

E\\X(x) - X(y)\\ < C\x - y\ a , 

for some C > and some a G (0, 1]. Then, there exists a unique Gaussian measure p, on 
C([0, l] d ,B) such that, ifY is a random variable with law [i, then Y(x) is equal in law to X(x) 
for every x. Furthermore, /i(C /3 ([0, l] d ,B)) = 1 for every (3 < a. 
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Proof. The proof is identical to that of Theorem 3.17, noting that the bound E||X(x) — < 
C p \x — y\ ap follows from the assumption and Proposition 3.14. □ 

Remark 3.19 The space C^([0, l] d ,R) is not separable. However, the space C^([0,l] d ,R) of 
Holder continuous functions that furthermore satisfy lim^a; ^4^ff = uniformly in x is 
separable (polynomials with rational coefficients are dense in it). This is in complete analogy 
with the fact that the space of bounded measurable functions is not separable, while the space of 
continuous functions is. 

It is furthermore possible to check that C 3 ' C Cq for every (3' > (3, so that Exercise 3.39 below 
shows that p, can actually be realised as a Gaussian measure on Cq ([0, l] d , R). 

Exercise 3.20 Try to find conditions on G C R rf that are as weak as possible and such that 
Kolmogorov's continuity theorem still holds if the cube [0, l] d is replaced by G. Hint: One 
possible strategy is to embed G into a cube and then to try to extend C(x, y) to that cube. 

Exercise 3.21 Show that if G is as in the previous exercise, TL is a Hilbert space, and C:GxG—> 
£(TL, TL) is such that C(x, y) positive definite, symmetric, and trace class for any two x, y G 
G, then Kolmogorov's continuity theorem still holds if its condition is replaced by trC(x,x) + 
tr C(y, y) — 2 tr C(x, y) < K\x — y\ a . More precisely, one can construct a measure /j, on the space 
C^([0,l] d ,TL) such that 

(h, f(x))(f(y), k) M4f) = (h, C(x,y)k) , 

C0([O,l] d ,R) 

for any x,y G G and h,k G TL. 

A very useful consequence of Kolmogorov's continuity criterion is the following result: 

Corollary 3.22 Let {r/fc}fc>o be countably many i.i.d. standard Gaussian random variables (real 
or complex). Moreover let {fk}k>o C Lip(G, C) where the domain G C R d is sufficiently regular 
for Kolomgorov's continuity theorem to hold. Suppose there is some 5 G (0, 2) such that 

S 2 i = E UMli- < 00 and S 2 = E IIMli»Lip(/fc)* < 00 . (3-9) 
fee/ fee/ 

and define f = J2kel Vkfk- Then f is almost surely bounded and Holder continuous for every 
Holder exponent smaller than 5/2. 

Proof. From the assumptions we immediately derive that f(x) and f(x) — f(y) are a centred 
Gaussian for any x,y G G. Moreover, the corresponding series converge absolutely. Using that 
the rjk are i.i.d., we obtain 

E|/(x) - f(y)\ 2 = E \fk(x) - fk(y)\ 2 < E min {2||/fe||loo,Lip(/ fc ) 2 |x - y\ 2 } 
fee/ fee/ 

< 2E \\fk\\ 2 L -JUp(f k ) s \x - y\ 5 = 2S 2 \x - y\ 5 , 
fee/ 

where we used that min{a, bx 2 } < a 1_5 / 2 6 5//2 |x| 5 for any a, b > 0. The claim now follows from 
Kolmogorov's continuity theorem. □ 

Remark 3.23 One should really think of the f^s in Corollary 3.22 as being an orthonormal basis 
of the Cameron-Martin space of some Gaussian measure. (See Section 3.2 below for the definition 
of the Cameron-Martin space associate to a Gaussian measure.) The criterion (3.9) then provides 
an effective way of deciding whether the measure in question can be realised on a space of Holder 
continuous functions. 
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3.2 The Cameron-Martin space 

Given a Gaussian measure /j, on a separable Banach space B, it is possible to associate to it in 
a canonical way a Hilbert space C B, called the Cameron-Martin space of fi. The main 
importance of the Cameron-Martin space is that it characterises precisely those directions in B in 
which translations leave the measure jj, 'quasi-invariant' in the sense that the translated measure 
has the same null sets as the original measure. In general, the space TL^ will turn out to be strictly 
smaller than B. Actually, this is always the case as soon as dim = oo and, even worse, we 
will see that in this case one necessarily has n{H^) = 0! Contrast this to the case of finite- 
dimensional Lebesgue measure which is invariant under translations in any direction! This is a 
striking illustration of the fact that measures in infinite-dimensional spaces have a strong tendency 
of being mutually singular. 

The definition of the Cameron-Martin space is the following, where we postpone to Re- 
mark 3.26 and Proposition 3.30 the verification that \\h\\^ is well-defined and that \\h\\^ > 
for h / 0: 

Definition 3.24 The Cameron-Martin space of /i is the completion of the linear subspace 
C B defined by 

= {h g B : 3 h* g B* with C^(h*J) = £(h) W e £>*} , 

under the norm \\h\\^ = (h, h)^ = C M (/i* ,h*). It is a Hilbert space when endowed with the scalar 
product (h, k)^ = C M (/i*, k*). 

Exercise 3.25 Convince yourself that the space is nothing but the range of the operator C M 
defined in Remark 3.8. 

Remark 3.26 Even though the map h i— > h* may not be one to one, the norm \\h\\^ is well- 
defined. To see this, assume that for a given h G H^, there are two corresponding elements hi and 
h\ in B*. Then, defining k = h\ + h^, one has 

C^hl hi) - C^h*, h*) = C^(hl,k) - C^(h* 2 , k) = k(h) - k(h) = , 

showing that ||/i|| M does indeed not depend on the choice of h*. 

Exercise 3.27 The Wiener measure fi is defined on B = C([0, 1], R) as the centred Gaussian mea- 
sure with covariance operator given by C^iSg, St) = s A t. Show that the Cameron-Martin space 

1 2 

for the Wiener measure on B = C([0, 1],R) is given by the space H ' ([0, 1]) of all absolutely 
continuous functions h such that /i(0) = and Jq h 2 (t) dt < oo. 

Exercise 3.28 Show that in the case B = R™, the Cameron-Martin space is given by the range of 
the covariance matrix. Write an expression for Wh]]^ in this case. 

Exercise 3.29 Show that the Cameron-Martin space of a Gaussian measure determines it. More 
precisely, if /i and v are two Gaussian measures on B such that = H u and such that ||/i|| M = 
\\h\\ v for every h G H^, then they are identical. 

For this reason, a Gaussian measure on B is sometimes given by specifying the Hilbert space 
structure (H^, \\ ■ || M ). Such a specification is then usually called an abstract Wiener space. 

Let us discuss a few properties of the Cameron-Martin space. First of all, we show that it is a 
subspace of B despite the completion procedure and that all non-zero elements of have strictly 
positive norm: 
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Proposition 3.30 One has C B. Furthermore, one has the bound 

(h,h)p > WC^Whf , (3.10) 

where the norms on the right hand side are understood to be taken in B. 

Proof. One has the chain of inequalities 

,,,,,2 m 2 C^(h*,£) 2 C^(h*,h*)C^e,£) 

eeB*\{o} ¥\\ e&B*\{o} ¥\\ feB*\{o} ¥\\ 

which yields the bound on the norms. The fact that is a subset of B (or rather that it can be 
interpreted as such) then follows from the fact that B is complete and that Cauchy sequences in 
Hfj, are also Cauchy sequences in B by (3.10). □ 

A simple example showing that the correspondence h i-> h* in the definition of is not 
necessarily unique is the case p = do, so that = 0. If one chooses h = 0, then any h* G B 
has the required property that C^h* , i) = £(h), so that this is an extreme case of non-uniqueness. 
However, if we view B* as a subset of L 2 (B, p) (by identifying linear functional that agree p- 
almost surely), then the correspondence h i-> h* is always an isomorphism. One has indeed 
J B h*(x) 2 p(dx) = C^(h* ,h*) = In particular, if h\ and h 2 are two distinct elements of B* 

associated to the same element h G B, then h\ — h* 2 is associated to the element and therefore 
J B (hl - h* 2 ) 2 {x) ji{dx) = 0, showing that h\ = h* 2 as elements of L 2 (B, fx). We have: 

Proposition 3.31 There is a canonical isomorphism i: h ^ h* between TL^ and the closure IZ^ 
ofB* in L 2 (B, fi). In particular, TL^ is separable. 

Proof. We have already shown that t: — > L 2 (B, p) is an isomorphism onto its image, so it 
remains to show that all of B* belongs to the image of t. For h G B* , define h* G B as in (3.5) by 



h* 



/ x h(x) /j,(dx) 

JB 



This integral converges since ||x|| 2 is integrable by Fernique's theorem. Since one has the identity 
£(h*) = Cfj,(£, h), it follows that h* G and h = i(h*), as required to conclude the proof. 

The separability of then follows immediately from the fact that L 2 (B, p) is separable when- 
ever B is separable, since its Borel cr-algebra is countably generated. □ 

Remark 3.32 The space is called the reproducing kernel Hilbert space for p (or just repro- 
ducing kernel for short). However, since it is isomorphic to the Cameron-Martin space in a natural 
way, there is considerable confusion between the two in the literature. We retain in these notes the 
terminology from [Bog98], but we urge the reader to keep in mind that there are authors who use 
a slightly different terminology. 

Remark 3.33 In general, there do exist Gaussian measures with non-separable Cameron-Martin 
space, but they are measures on more general vector spaces. One example would be the measure 
on R R (yes, the space of all functions from R to R endowed with the product cr-algebra) given 
by the uncountable product of one-dimensional Gaussian measures. The Cameron-Martin space 
for this somewhat pathological measure is given by those functions / that are non-zero on at most 
countably points and such that J2teR l/(*)| 2 < °°- This is a prime example of a non-separable 
Hilbert space. 
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Exercise 3.34 Let fi be a Gaussian measure on a Hilbert space H with covariance K and consider 
the spectral decomposition of K: Ke n = X n e n with J2 n >i A n < oo and {e n } an orthonormal 
basis of eigenvectors. Such a decomposition exists since we already know that K must be trace 
class from Proposition 3.15. 

Assume now that A n > for every n. Show that is given by the range of K and that 
the correspondence h ^ h* is given by h* = K~ l h. Show furthermore that the Cameron- 
Martin space consists of those elements h of H such that J2 n >i Ki 1 ^, e n} 2 < oo and that 

Exercise 3.35 Show that one has the alternative characterisation 

||% = suv{£(h) : C^(£,£) < 1} , (3.11) 

and Ji^ = {h € B : ||/i||^ < oo}. Hint: Use the fact that in any Hilbert space H, one has 
\\h\\ = mp{(k,h) : \\k\\ < 1}. 

Since elements in IZ^ are built from the space of all bounded linear functionals on B, it should 
come as little surprise that its elements are 'almost' linear functionals on B in the following sense: 

Proposition 3.36 For every I 6 IZ^ there exists a measurable linear subspace Ve of B such that 
fJ-iVe) = 1 and a linear map I: Ve — > R such that 1 = 1 ii-almost surely. 

Proof. Fix £ 6 IZ^. By the definition of TZ^ and Borel-Cantelli, we can find a sequence £ n £ B* 
such that lim n ^ 00 £ n (x) = £{x) for /i-almost every x G B. (Take for example £ n such that 
\\£ n — < n~ 4 .) It then suffices to define 

Ve = \ x : lim £ n (x) exists f , 

and to set £(x) = lim^oo £ n (x) on Ve. □ 

Another very useful fact about the reproducing kernel space is given by: 

Proposition 3.37 The law of any element h* = t(h) 6 IZ^ is a centred Gaussian with variance 
||/i||^. Furthermore, any two elements h* , k* have covariance (h, k}^. 

Proof. We already know from the definition of a Gaussian measure that the law of any element of 
B* is a centred Gaussian. Let now h* be any element of TZ^ and let h n be a sequence in TZ^ n B* 
such that h n — ► h* in 7?.^. We can furthermore choose this approximating sequence such that 
||^n||7£ M = ||^*||7e M = 1 1 h 1 1 At , so that the law of each of the h n is equal to A^(0, ||/i||^). 

Since I? -convergence implies convergence in law, we conclude that the law of h* is also given 
by AA(0, The statement about the covariance then follows by polarisation, since 

Eh*k* = l -(E{h* + k*f - E(h*f - E(F) 2 ) = ±(\\h + fc||J - \\h\\l - \\k\\D = (h, fc)„ , 

by the previous statement. □ 

Remark 3.38 Actually, the converse of Proposition 3.36 is also true: if £: B — ► R is measurable 
and linear on a measurable linear subspace V of full measure, then £ belongs to IZ^. This is 
not an obvious statement. It can be viewed for example as a consequence of the highly non- 
trivial fact that every Borel measurable linear map between two sufficiently 'nice' topological 
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vector spaces is bounded, see for example [Sch66, Kat82]. (The point here is that the map must 
be linear on the whole space and not just on some "large" subspace as is usually the case with 
unbounded operators.) This implies by Proposition 3.42 that £ is a measurable linear extension 
of some bounded linear functional on Ti^. Since such extensions are unique (up to null sets) by 
Theorem 3.47 below, the claim follows from Proposition 3.31. 

Exercise 3.39 Show that if B C B is a continuously embedded Banach space with p(B) = 1, 
then the embedding B* <^-> extends to an embedding B* <^-> TZ^. Deduce from this that the 
restriction of p to B is again a Gaussian measure. In particular, Kolmogorov's continuity criterion 
yields a Gaussian measure on Cq ([0, l] d , R). 

The properties of the reproducing kernel space of a Gaussian measure allow us to give another 
illustration of the fact that measures on infinite-dimensional spaces behave in a rather different 
way from measures on R n : 

Proposition 3.40 Let pbe a centred Gaussian measure on a separable Banach space B such that 
dim = oo. Denote by D c the dilatation by a real number c on B, that is D c (x) = ex. Then, p 
and D*p are mutually singular for every c / ±1. 

Proof. Since the reproducing Kernel space IZ^ is a separable Hilbert space, we can find an or- 
thonormal basis {e n }„>o- Consider the sequence of random variables Xn(x) = J2n=i \ e n(x)\ 2 
over B. If B is equipped with the measure p then, since the e n are independent under p, we can 
apply the law of large numbers and deduce that 

lim X N (x) = 1 , (3.12) 

N— >oo 

for ^-almost every x. On the other hand, it follows from the linearity of the e n that when we equip 
B with the measure D*p, the e n are still independent, but have variance c 2 , so that 

lim Xn{x) = c 2 , 

for D^-almost every x. This shows that if c ^ ±1, the set on which the convergence (3.12) takes 
place must be of D* ^-measure 0, which implies that p and D*p are mutually singular. □ 

As already mentioned earlier, the importance of the Cameron-Martin space is that it represents 
precisely those directions in which one can translate the measure p without changing its null sets: 

Theorem 3.41 (Cameron-Martin) For h G B, define the map T^-.B — > B by T)j(x) = x + h. 
Then, the measure T£p is absolutely continuous with respect to p if and only ifh£ 7i^. 

Proof. Fix h 6 TL^ and let h* G L 2 (B, p) be the corresponding element of the reproducing kernel. 
Since the law of h* is Gaussian by Proposition 3.37, the map x i-> exp(h*(x)) is integrable. Since 
furthermore the variance of h* is given by the function 

V h {x) = zxv(h\x)-\\\h\\D (3.13) 

is strictly positive, belongs to L X {B, p), and integrates to 1. It is therefore the Radon-Nikodym 
derivative of a measure ph that is absolutely continuous with respect to p. To check that one 
has indeed ph = T£p, it suffices to show that their Fourier transforms coincide. Assuming that 
h* G B*, one has 

p h (£) = f exp(^Or) + h*(x) - ±\\h\\D p{dx) = exp(iC„(i£ + h*,i£ + h*) - ±\\h\\D 
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= exp(-^C^(£,£)-iC^£,h*)) = exp{-^(£,£) + i£(h)) ■ 

Using Proposition 3.37 for the joint law of £ and h*, it is an easy exercise to check that this equality 
still holds for arbitrary h G Ti^. 
On the other hand, we have 

f*jj,(£) = ( exp(i£(x))T£p(dx) = [ exp(i£(x + h)) n{dx) = e mh) [ exp(i£(x)) (j,(dx) 
Jb Jb Jb 

= exp(-±C lx (£,£) + i£(h)) , 

showing that p^ = Tj* p. 

To show the converse, note first that one can check by an explicit calculation that ||AA(0, 1) — 
N{h, 1)||tv > 2 - 2exp(-^). Fix now some arbitrary n > 0. If h then, by Exercise 3.35, 
there exists £ 6 B* with C^i, £) = 1 such that £(h) > n. Since the image £*p of p under £ is 
jV(0, 1) and the image of T£p under £ is Af(-£(h), 1), this shows that 

II/* - T h V||TV > IK> - ^%VI|tv = ||AA(0, 1) - J\f(-£(h), 1)||tv > 2 - 2exp(-— ) . 

Since this is true for every n, we conclude that \\p — T^||tv = 2, thus showing that they are 
mutually singular. □ 



As a consequence, we have the following characterisation of the Cameron-Martin space 



Proposition 3.42 The space TC^ C B is the intersection of all (measurable) linear subspaces of 
full measure. However, ifTL^ is infinite-dimensional, then one has p(Tt^) = 0. 

Proof. Take an arbitrary linear subspace V C B of full measure and take an arbitrary h € H^. It 
follows from Theorem 3.41 that the affine space V — h also has full measure. Since (V—h)C\V = 4> 
unless h £ V, one must have h G V, so that C f]{V C B : p(V) = 1}. 

Conversely, take an arbitrary x and let us construct a linear space V C Bof full measure, 
but not containing x. Since x 7^, one has ||x|| M = oo with || • || M extended to B as in (3.11). 
Therefore, we can find a sequence £ n 6 B* such that C^(£ ni £ n ) < 1 and £„(x) > n. Defining the 
norm \y\ 2 = J2 n n~ 2 {£ n {y)f, we see that 

f °° 1 f IT 2 

/ \y\ 2 p(dy) = E — / (tn(y)) 2 Kdy) < — , 

J B n=l ^ 

so that the linear space F = {y : |y| < oo} has full measure. However, \x\ = oo by construction, 
so that x V. 

To show that p{H^) = if dim 7^ = oo, consider an orthonormal sequence e n € so that 
the random variables {e n (x)} are i.i.d. M(0, 1) distributed. By the second Borel-Cantelli lemma, it 
follows that sup n |e n (x)| = oo for p-almost every x, so that in particular ||x||^ > J2 n e n( x ) = 00 
almost surely. □ 



Exercise 3.43 Recall that the (topological) support supp p of a Borel measure on a complete sep- 
arable metric space consists of those points x such that p(U) > for every neighbourhood U of 
x. Show that, if p is a Gaussian measure, then its support is the closure of in B. 
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3.3 Images of Gaussian measures 

It follows immediately from the definition of a Gaussian measure and the expression for its Fourier 
transform that if p is a Gaussian measure on some Banach space B and A: B — > £>2 is a bounded 
linear map for B2 some other Banach space, then v = A* \i is a Gaussian measure on B2 with 
covariance 

C U (£,£') = C^A*£,A*£') , 

where A*:B\ — ► B* is the adjoint to A, that is the operator such that (A*£)(x) = £(Ax) for every 
x e B and every £ G B\. 

Recall now that TL^ is the intersection over all linear subspaces of B that have full measure 
under /x. This suggests that in order to determine the image of \i under a linear map, it is sufficient 
to know how that map acts on elements of Ji^. This intuition is made precise by the following 
theorem: 



Theorem 3.44 Let \i be a centred Gaussian probability measure on a separable Banach space 
B. Let furthermore Jibe a separable Hilbert space and let A: Tin — > TL be a Hilbert-Schmidt 
operator. (That is AA*:Tt — > TL is trace class.) Then, there exists a measurable map A:B — > TL 
such that v = A* p is Gaussian with covariance C u (h,k) = {A*h, A*k) ^. Furthermore, there 
exists a measurable linear subspace V C B of full ^-measure such that A restricted to V is linear 
and A restricted to TL^ C V agrees with A. 

Proof. Let {e n } n >i be an orthonormal basis for TL^ and denote by e* the corresponding elements 
in C L 2 (B, p) and define Sn(x) = J2n=o e n( x )Ae n . Recall from Proposition 3.36 that we 
can find subspaces V en of full measure such that e* is linear on V Cn . Define now a linear subspace 
V C ,6 by 

V = jx G P| V en : the sequence {Sn(x)} converges in 7i| , 

n>0 

(the fact that V is linear follows from the linearity of each of the e* ) and set 

h x \ = { lim A^oo S N (x) for x e V, 
\ otherwise. 

Since the random variables {e* } are i.i.d. Af(0, l)-distributed under jjl, the sequence {Sn} forms 
an H-valued martingale and one has 

00 

sup E M \\S N (x)\\ 2 = \\ Ae n\\ 2 < tiA*A < 00 , 

N n=0 

where the last inequality is a consequence of A being Hilbert-Schmidt. It follows that p(V) = 1 
by Doob's martingale convergence theorem. 

To see that v = A* \x has the stated property, fix an arbitrary h G H and note that the series 
J2n>i e n(Ae n , h) converges in TZ^ to an element with covariance H^*/?-!! 2 . The statement then 
follows from Proposition 3.37 and the fact that C u (h, h) determines C v by polarisation. To check 
that v is Gaussian, we can compute its Fourier transform in a similar way. □ 

Remark 3.45 Similarly to Proposition 3.36, the converse is again true: if A: B — > H is a measur- 
able map which is linear on a measurable subspace of full measure and agrees with A on TL^, then 
it agrees /i-almost surely with the extension constructed in Theorem 3.44. 
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The proof of Theorem 3.44 can easily be extended to the case where the image space is a 
Banach space rather than a Hilbert space. However, in this case we cannot give a straightforward 
characterisation of those maps A that are 'admissible', since we have no good complete character- 
isation of covariance operators for Gaussian measures on Banach spaces. However, we can take 
the pragmatic approach and simply assume that the new covariance determines a Gaussian mea- 
sure on the target Banach space. With this approach, we can formulate the following version for 
Banach spaces: 

Proposition 3.46 Let B\ and B2 be two separable Banach space and let pbe a centred Gaussian 
probability measure on B\. Let A: TC^ — ► B2 be a bounded linear operator such that there exists 
a centred Gaussian measure v on B2 with covariance C u (h, k) = (A*h, A*/c) M . Then, there exists 
a measurable map A:B\ — > B2 such that v = A* p and such that there exists a measurable 
linear subspace V C B of full ^-measure such that A restricted to V is linear and A restricted to 
TL^ C V agrees with A. 

Proof. As a first step, we construct a Hilbert space H2 such that B2 C H2 as a Borel subset. 
Denote by TL V C B2 the Cameron-Martin space of v and let {e„} C TL V be an orthonormal basis 
of elements such that e* G B\ for every n. (Such an orthonormal basis can always be found by 
using the Grahm-Schmidt procedure.) We then define a norm on B2 by 



n>l 



E 



12 " ^ n*\ 



where ||e* || is the norm of e* in B\. It is immediate that ||x||2 < 00 for every x G B2, so that this 
turns £>2 into a pre-Hilbert space. We finally define H2 as the completion of B2 under || • H2. 

Denote by v' the image of the measure v under the inclusion map 1: B2 Ti-2- It follows 
that the map A' = 1 o A satisfies the assumptions of Theorem 3.44, so that there exists a map 
A: B\ — > TL2 which is linear on a subset of full /i-measure and such that A*p = v' . On the other 
hand, we know by construction that v'{Bi) = h so that the set {x : Ax G B2} is of full measure. 
Modifying A outside of this set by for example setting it to and using Exercise 3.39 then yields 
the required statement. □ 

3.3.1 Uniqueness of measurable extensions and the isoperimetric inequality 

This section is devoted to a proof of the converse of Theorem 3.44 and Proposition 3.46, namely 

Theorem 3.47 Let p, be a Gaussian measure on a separable Banach space B\ with Cameron- 
Martin space and let A:7i^ — > B2 be a linear map satisfying the assumptions of Proposi- 
tion 3.46. Then the linear measurable extension A of A is unique, up to sets of measure 0. 

Remark 3.48 As a consequence of this result, the precise Banach spaces B\ and B2 are com- 
pletely irrelevant when one considers the image of a Gaussian measure under a linear transforma- 
tion. The only thing that matters is the Cameron-Martin space for the starting measure and the 
way in which the linear transformation acts on this space. This fact will be used repeatedly in the 
sequel. 

This is probably one of the most remarkable results in Gaussian measure theory. At first sight, 
it appears completely counterintuitive: the Cameron-Martin space has measure 0, so how can 
the specification of a measurable map on a set of measure be sufficient to determine it on a set 
of measure 1? Part of the answer lies of course in the requirement that the extension A should be 
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linear on a set of full measure. However, even this requirement would not be sufficient by itself 
to determine A since the Hahn-Banach theorem provides a huge number of different extension of 
A that do not coincide anywhere except on H^. The missing ingredient that solves this mystery 
is the requirement that A is not just any linear map, but a measurable linear map. This additional 
constraint rules out all of the non-constructive extensions of A provided by the Hahn-Banach 
theorem and leaves only one (constructive) extension of A. 

The main ingredient in the proof of Theorem 3.47 is the Borell-Sudakov-Cirel'son inequality 
[SC74, Bor75], a general form of isoperimetric inequality for Gaussian measures which is very 
interesting and useful in its own right. In order to state this result, we first introduce the notation 
B £ for the H^-ball of radius e centred at the origin. We also denote by A + B the sum of two sets 
defined by 

A + B = {x + y: xeA, y e B} , 

and we denote by $ the distribution function of the normal Gaussian: <£(i) = e~ s2//2 ds. 

With these notations at hand, we have the following: 

Theorem 3.49 (Borell-Sudakov-Cirel'son) Let fj.be a Gaussian measure on a separable Banach 
space B with Cameron-Martin space TL^ and let A C B be a measurable subset with measure 
fx(A) = &(a)for some a £ R. Then, for every e > 0, one has the bound p,(A + B £ ) > <I>(a + e). 

Remark 3.50 Theorem 3.49 is remarkable since it implies that even though Ti^ itself has measure 
0, whenever A is a set of positive measure, no matter how small, the set A + has full measure! 

Remark 3.51 The bound given in Theorem 3.49 is sharp whenever A is a half space, in the sense 
that A = {x G B : l(x) > c} for some £ G TZ^ and c G R. In the case where e is small, 
{A + B £ ) \ A is a fattened boundary for the set A, so that fi(A + B e ) — p,{A) can be interpreted as 
a kind of 'perimeter' for A. The statement can then be interpreted as stating that in the context of 
Gaussian measures, half-spaces are the sets of given perimeter that have the largest measure. This 
justifies the statement that Theorem 3.49 is an isoperimetric inequality. 

We are not going to give a proof of Theorem 3.49 in these notes because this would lead us 
too far astray from our main object of study. The interested reader may want to look into the 
monograph [LT91] for a more exhaustive treatment of probability theory in Banach spaces in 
general and isoperimetric inequalities in particular. Let us nevertheless remark shortly on how the 
argument of the proof goes, as it can be found in the original papers [SC74, Bor75]. In a nutshell, 
it is a consequence of the two following remarks: 

• Let vm be the uniform measure on a sphere of radius \f~M in R M and let Hm,u be the 
orthogonal projection from R M to R™. Then, the sequence of measures II^a^m converges 
as M — > oo to the standard Gaussian measure on R ra . This remark is originally due to 
Poincare. 

• A claim similar similar to that of Theorem 3.49 holds for the uniform measure on the 
sphere, in the sense that the volume of a fattened set A + B £ on the sphere is bounded from 
below by the volume of a fattened 'cap' of volume identical to that of A. Originally, this 
fact was discovered by Levy, and it was then later generalised by Schmidt, see [Sch48] or 
the review article [Gar02]. 

These two facts can then be combined in order to show that half-spaces are optimal for finite- 
dimensional Gaussian measures. Finally, a clever approximation argument is used in order to 
generalise this statement to infinite-dimensional measures as well. 

An immediate corollary is given by the following type of zero-one law for Gaussian measures: 
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Corollary 3.52 Let V C B be a measurable linear subspace. Then, one has either fi{V) = or 

tiiV) = l. 

Proof. Let us first consider the case where <jt V. In this case, just as in the proof of Proposi- 
tion 3.42, we conclude that fi(V) = 0, for otherwise we could construct an uncountable collection 
of disjoint sets with positive measure. 

If Ji^ C V, then we have V + B £ = V for every e > 0, so that if fi(V) > 0, one must have 
fi(V) = 1 by Theorem 3.49. □ 

We have now all the necessary ingredients in place to be able to give a proof of Theorem 3.47: 

Proof of Theorem 3.47. Assume by contradiction that there exist two measurable extensions A\ 
and A2 of A. In other words, we have A; L x = Ax for x G and there exist measurable 
subspaces Vi with /J,(Vi) = 1 such that the restriction of Aj, to Vi is linear. Denote V = V\ PI V2 
and A = A2 — A\, so that A is linear on V and A|^ M = 0. 

Let I G B\ be arbitrary and consider the events Vf = {x : £(Ax) < c}. By the linearity 
of A, each of these events is invariant under translations in Ti^, so that by Theorem 3.49 we 
have /i(Vg c ) G {0, 1} for every choice of I and c. Furthermore, for fixed I, the map c ^ ^(Vf) 
is increasing and it follows from the cr-additivity of fi that we have lim c __ > _ 00 fi(Vf) = and 
limc^oo fi(V e c ) = 1. Therefore, there exists a unique q € R such that /J,(V[) jumps from to 1 
at c = C£. In particular, this implies that £(Ax) = q ^(-almost surely. However, the measure \i is 
invariant under the map x 1— > — x, so that we must have q = —C£, implying that eg = 0. Since 
this is true for every t 6 B%, we conclude from Proposition 3.6 that the law of Ax is given by the 
Dirac measure at 0, so that Ax = /[/-almost surely, which is precisely what we wanted. □ 

In the next section, we will see how we can take advantage of this fact to construct a the- 
ory of stochastic integration with respect to a "cylindrical Wiener process", which is the infinite- 
dimensional analogue of a standard n-dimensional Wiener process. 

3.4 Cylindrical Wiener processes and stochastic integration 

Central to the theory of stochastic PDEs is the notion of a cylindrical Wiener process. Recall that 
in general a stochastic process X taking values in a separable Banach space B is nothing but a 
collection {X(t)} of ^-valued random variables indexed by time t G R (or taking values in some 
subset of R). A notable special case which will be of interest here is the case where the probability 
space is taken to be for example Q = C([0, T],B) (or some other space of B- valued continuous 
functions) endowed with some Gaussian measure P and where the process X is given by 

X(t)(u) = w(t) , uj e n . 

In this case, X is called the canonical process on Q. 

Recall that the usual (one-dimensional) Wiener process is a real-valued centred Gaussian 
process B{t) such that B(0) = and E\B(t) — B(s)\ 2 = \t — s\ for any pair of times s,t. 
From our point of view, the Wiener process on any finite time interval / can always be re- 
alised as the canonical process for the Gaussian measure on C(7,R) with covariance function 
C(s, t) = s A t = min{s, t}. (Note that such a measure exists by Kolmogorov's continuity crite- 
rion.) 

Since the space C(R, R) is not a Banach space and we have not extended our study of Gaussian 
measures to Frechet spaces, we refrain from defining a measure on it. However, one can define 
Wiener measure on a separable Banach space of the type 

C,(R+,R) = {/ G C(R+,R) : lim f(t)/g(t) exists} , \\f\\ e = sup iffl , 

t^OO J teR Q(t) 
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for a suitable weight function g:K —> [l,oo). For example, we will see that g(t) = 1 + t 2 is 
suitable, and we will therefore define Cw = C e for this particular choice. 

Proposition 3.53 There exists a Gaussian measure p, on Cw with covariance function C(s,t) = 
s At. 

Proof. We use the fact that / € C([0, vr], R) if and only if the function T(f) given by T(f)(t) = 
(1 + i 2 )/(arctan t) belongs to Cw- Our aim is then to construct a Gaussian measure on 
C([0, 7r], R) which is such that T*po has the required covariance structure. 
The covariance Co for p,Q is then given by 

tan x A tan y 

C (x, y) = 



(1 + tan 2 x)(l + tan 2 y) ' 

It is now a straightforward exercise to check that this covariance function does indeed satisfy the 
assumption of Kolmogorov's continuity theorem. □ 

Let us now fix a (separable) Hilbert space H, as well as a larger Hilbert space H! containing 
H as a dense subset and such that the inclusion map t:7i —>■ H' is Hilbert-Schmidt. Given H, it is 
always possible to construct a space H' with this property: choose an orthonormal basis {e n } of 
H and take H' to be the closure of H under the norm 

oo ^ 

One can check that the map a* is then given by u*e n = -^e n , so that it is indeed trace class. 

Definition 3.54 Let H and H! be as above. We then call a cylindrical Wiener process on H any 
W-valued Gaussian process W such that 

E(h, W(s)) H ,{W(t), k) w = (s A t)(i*h, L*k) = (s A t){u*h, k) w , (3.14) 

for any two times s and t and any two elements h, k € H'. By Kolmogorov's continuity theorem, 
this can be realised as the canonical process for some Gaussian measure on Cw(R, H'). 

Alternatively, we could have defined the cylindrical Wiener process on H as the canonical 
process for any Gaussian measure with Cameron-Martin space H^' 2 ([0,T],H), see Exercise 3.27. 

Proposition 3.55 In the same setting as above, the Gaussian measure fi on Ti! with covariance 
u* has H as its Cameron-Martin space. Furthermore, ||/i|| 2 = ||/i|| 2 /or every h G Ti. 

Proof. It follows from the definition of that this is precisely the range of a* and that the map 
h i — ^ hT is given by h* = {a* )~ 1 h. In particular, is contained in the range of l. Therefore, for 
any h, k G Ti^, there exist h,geH such that h = ih and k = ik. Using this, we have 

(h,k)v = ((u*)h*,k*) n > = (h^u*)- 1 ^ = (tkiu*)' 1 ^ = (h,i*(u*rhk) = (h,k) , 
from which the claim follows. □ 
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The name 'cylindrical Wiener process on 7i' may sound confusing at first, since it is actually 
not an H-valued process. (A better terminology may have been 'cylindrical Wiener process over 
H\ but we choose to follow the convention that is found in the literature.) Note however that 
if h is an element in H that is in the range of t* (so that ih belongs to the range of u* and 
i*{u*y l ih = h), then 



In particular, if we just pretend for a moment that W(t) belongs to H for every t (which is of 
course not true!), then we get 



Here we used (3. 14) to go from the first to the second line. This shows that W(t) should be thought 
of as an H-valued random variable with covariance t times the identity operator (which is of course 
not trace class if H is infinite-dimensional, so that such an object cannot exist if dimW = oo). 
Combining Proposition 3.55 with Theorem 3.44, we see however that if K. is some Hilbert space 
and A: H — > K, is a Hilbert-Schmidt operator, then the /C-valued random variable AW(t) is well- 
defined. (Here we made an abuse of notation and also used the symbol A for the measurable 
extension of A to Ti' .) Furthermore, its law does not depend on the choice of the larger space H' . 

Example 3.56 (White noise) Recall that we informally defined 'white noise' as a Gaussian pro- 
cess £ with covariance E£(s)£(i) = 6(t — s). In particular, if we denote by (•, •) the scalar product 
in L 2 (R), this suggests that 



E<<7, (h, = E / / g(s)h(ms)m dsdt= / / g(s)h(t)S(t - s) ds dt = (g, h) . (3. 15) 



This calculation shows that white noise can be constructed as a Gaussian random variable on any 
space of distributions containing L 2 (R) and such the embedding is Hilbert-Schmidt. Furthermore, 
by Theorem 3.44, integrals of the form J g(s)^(s) ds are well-defined random variables, provided 
that g £ L 2 (R). Taking for g the indicator function of the interval [0, t], we can check that the 
process B(t) = Jq £(s) ds is a Brownian motion, thus justifying the statement that 'white noise is 
the derivative of Brownian motion' . 

The interesting fact about this construction is that we can use it to define space-time white 
noise in exactly the same way, simply replacing L 2 (R) by L 2 (R 2 ). 

This will allow us to define a Hilbert space-valued stochastic integral against a cylindrical 
Wiener process in pretty much the same way as what is usually done in finite dimensions. In the 
sequel, we fix a cylindrical Wiener process W on some Hilbert space H C H', which we realise 
as the canonical coordinate process on = Cvi/(R+, Ti!) equipped with the measure constructed 
above. We also denote by F s the cr-field on generated by {W r : r < s}. 

Consider now a finite collection of disjoint intervals (s n , t n ] C R+ with n = 1, . . . , N and 
a corresponding finite collection of F Sn -measurable random variables taking values in the 
space C2CH, KL) of Hilbert-Schmidt operators from TL into some other fixed Hilbert space /C. Let 
furthermore $ be the L 2 (R + x Q, C2(H, /C))-valued function defined by 



(h,k) = (t*(u*y 



1 th,k) = ((a*) 1 Lh,tk}')- i r . 



E{h, W(s))(W(t), k) = E((u*yhh, iW(s)) nl ((u*r l ik, iW{t)) H , 



= (s A fiiifiifyhh, (u*r 1 Lk) w 

= (sA t)(ih, (LL*)~ l Lk) n < = (s A t)(h, i*(u*r 1 ik) n > 

= (s A t)(h, k) . 





N 



n=l 
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where we denoted by 1a the indicator function of a set A. We call such a $ an elementary process 
on TL. 



Definition 3.57 Given an elementary process $ and a cylindrical Wiener process W on TL, we 
define the /C-valued stochastic integral 

roo N 

/ Ht) dW(t) ^ Yl ®n{W) (W(t n ) - W(s n )) . 

Note that since <£„ is F Sn -measurable, & n (W) is independent of W(t n ) — W(s n ), therefore each 
term on the right hand side can be interpreted in the sense of the construction of Theorems 3.44 
and 3.47. 



Remark 3.58 Thanks to Theorem 3.47, this construction is well-posed without requiring to spec- 
ify the larger Hilbert space TL' on which W can be realised as an W-valued process. This justifies 
the terminology of W being "the cylindrical Wiener process on TL" without any mentioning of TL' , 
since the value of stochastic integrals against W is independent of the choice of TL'. 

It follows from Theorem 3.44 and (3.6) that one has the identity 

N 



E 



/ $(t)dW(t) = Y,Vtr(®n(W)$* n (W))(t n -s n ) = E / tr <&(*)<&*(*) eft, (3.16) 
Jo ic ^ Jo 



which is an extension of the usual Ito isometry to the Hilbert space setting. It follows that the 
stochastic integral is an isometry from the subset of elementary processes in L 2 (R+ xf!, C2CH, 1Q) 
to L 2 (fl,lC). 

Let now F pr be the 'predictable' u-field, that is the u-field over R + x SI generated by all subsets 
of the form (s, t] x A with t > s and A 6 F s . This is the smallest u-algebra with respect to which 
all elementary processes are F pr -measurable. One furthermore has: 

Proposition 3.59 The set of elementary processes is dense in the space Lp r (R + x S7,£2(W,/C)) 
of all predictable C2CH, K.)-valued processes. 

Proof. Denote by F pr the set of all sets of the form (s, t] x A with A G F s . Denote furthermore 
by Lp r the closure of the set of elementary processes in L 2 . One can check that F pr is closed under 
intersections, so that 1q G i pr for every set G in the algebra generated by F pr . It follows from the 
monotone class theorem that 1q G L 2 r for every set G G F pr . The claim then follows from the 
definition of the Lebesgue integral, just as for the corresponding statement in R. □ 

By using the Ito isometry (3.16) and the completeness of L 2 (fi, AC), it follows that: 



Corollary 3.60 The stochastic integral Jq°° <3?(i) dW(t) can be uniquely defined for every process 
$e^(R + xl],£ 2 (H,/C)). 

This concludes our presentation of the basic properties of Gaussian measures on infinite- 
dimensional spaces. The next section deals with the other main ingredient to solving stochastic 
PDEs, which is the behaviour of deterministic linear PDEs. 
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4 A Primer on Semigroup Theory 

This section is strongly based on Davies's excellent monograph [Dav80] for the first part on 
strongly continuous semigroups and very loosely follows [Yos95] and [Lun95] for the second 
part on analytic semigroups. Another good reference on some of the material covered here is the 
monograph [Paz83]. Its aim is to give a rigorous meaning to solutions to linear equations of the 
type 

d t x = Lx , x(0) = x G B , (4.1) 

where x takes values in some Banach space B and L is a possibly unbounded operator on B. From 
a formal point of view, if such a solution exists, one expects the existence of a linear operator S(t) 
that maps the initial condition xq onto the solution x(t) of (4.1) at time t. If such a solution is 
unique, then the family of operators S(t) should satisfy 5(0) = 1 and S(t) o S(s) = S(t + s). This 
is called the semigroup property. 

Furthermore, such a family of solution operators S(t) should have some regularity as t — > 
in order to give a meaning to the notion of an initial condition. (The family given by S(t) = 
for t > and 5(0) = 1 does satisfy the semigroup property but clearly doesn't define a family of 
solution operators to an equation of the type (4.1).) 

This motivates the following definition: 

Definition 4.1 A semigroup S(t) on a Banach space B is a family of bounded linear operators 
{5(t)}t> with the properties that S(t) o 5(s) = S(t + s) for any s, t > and that 5(0) = Id. A 
semigroup is furthermore called 

• strongly continuous if the map (x, t) i— > S(t)x is strongly continuous. 

• analytic if there exists 6 > such that the operator-valued map t i— > S(t) has an analytic 
extension to {A G C : | argA| < 6}, satisfies the semigroup property there, and is such 
that 1 1 ^ S{e l(p t) is a strongly continuous semigroup for every angle 92 with \(p\ < 6. 

A strongly continuous semigroup is also sometimes called a Co-semigroup. 

Exercise 4.2 Show that being strongly continuous is equivalent to t S(t)x being continuous at 
t = for every x G £> and the operator norm of S(t) being bounded by Me at for some constants 
M and a. Show then that the first condition can be relaxed to t w S(t)x being continuous for 
all x in some dense subset of B. (However, the second condition cannot be relaxed in general. 
See Exercise 5.19 on how to construct a semigroup of bounded operators such that ||5(t)|| is 
unbounded near t = 0.) 

Remark 4.3 Some authors, like [Lun95], do not impose strong continuity in the definition of an 
analytic semigroup. This can result in additional technical complications due to the fact that the 
generator may then not have dense domain. The approach followed here has the slight drawback 
that with our definitions the heat semigroup is not analytic on L°°(R). (It lacks strong continuity 
as can be seen by applying it to a step function.) It is however analytic for example on Co(R), the 
space of continuous functions vanishing at infinity. 

This section is going to assume some familiarity with functional analysis. All the necessary 
results can be found for example in the classical monograph by Yosida [Yos95]. Recall that an 
unbounded operator L on a Banach space B consists of a linear subspace V(L) C B called the 
domain of L and a linear map L: V{L) — > B. The graph of an operator is the subset of B x B 
consisting of all elements of the form (x, Lx) with x G T>(L). An operator is closed if its graph is 
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a closed subspace of B x B. It is closable if the closure of its graph is again the graph of a linear 
operator and that operator is called the closure of L. 

The domain V(L*) of the adjoint L* of an unbounded operator L: V(L) — > B is defined as 
the set of all elements I <E B* such that there exists an element L*£ G B* with the property that 
(L*£)(x) = £(Lx) for every x G V(L). It is clear that in order for the adjoint to be well-defined, 
we have to require that the domain of L is dense in B. Fortunately, this will be the case for all the 
operators that will be considered in these notes. 

Exercise 4.4 Show that L being closed is equivalent to the fact that if {x n } C V(L) is Cauchy in 
B and {Lx n } is also Cauchy, then x = lim^oo x n belongs to V{L) and Lx = lim^oo Lx n . 

Exercise 4.5 Show that the adjoint of an operator with dense domain is always closed. 

The resolvent set g(L) of an operator L is defined by 

q(L) = {A G C : range(A — L) is dense in B and A — L has a continuous inverse.} , 

and the resolvent R\ is given for A G g(L) by R\ = (A — L) -1 . (Here and in the sequel we view 
Sasa complex Banach space. If an operator is defined on a real Banach space, it can always be 
extended to its complexification in a canonical way and we will identify the two without further 
notice in the sequel.) The spectrum of L is the complement of the resolvent set. 

The most important results regarding the resolvent of an operator that we are going to use are 
that any closed operator L with non-empty resolvent set is defined in a unique way by its resolvent. 
Furthermore, the resolvent set is open and the resolvent is an analytic function from g(L) to the 
space C(B) of bounded linear operators on B. Finally, the resolvent operators for different values 
of A all commute and satisfy the resolvent identity 

R\ - = — ^)RfiR\ . 

for any two A, /j, G q{L). 

The fact that the resolvent is operator- valued should not be a conceptual obstacle to the use of 
notions from complex analysis. Indeed, for D C C an open domain, a function f:D—>B where 
B is any complex Banach space (typically the complexification of a real Banach space which we 
identify with the original space without further ado) is said to be analytic in exactly the same way 
as usual by imposing that its Taylor series at any point a G D converges to / uniformly in B 
on a neighbourhood of a. The same definition applies if D C R and analytic continuation then 
works in exactly the same way as for complex-valued functions. In particular, Cauchy's residue 
theorem, which is the main result from complex analysis that we are going to use later on, works 
for Banach-space valued functions in exactly the same way as for complex- valued functions. 

4.1 Strongly continuous semigroups 

We start our investigation of semigroup theory with a discussion of the main results that can be 
obtained for strongly continuous semigroups. Given a Co -semigroup, one can associate to it a 
'generator', which is essentially the derivative of S(t) at t = 0: 

Definition 4.6 The generator L of a Co-semigroup is given by 

Lx = lim j t-\S(t)x - x) , (4.2) 

on the set V{L) of all elements x G B such that this limit exists (in the sense of strong convergence 
in B). 
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The following result shows that if L is the generator of a Co-semigroup S(t), then x(t) = 
S(t)xo is indeed the solution to (4. 1) in a weak sense. 

Proposition 4.7 The domain V{L) of L is dense in B, invariant under S, and the identities 
dfS(t)x = LS(t)x = S(t)Lx hold for every x £ T){L) and every t > 0. Furthermore, for 
every I £ V(L*) and every x £ B, the map t ^ (£, S(t)x) is differentiable and one has 
d t (£,S(t)x) = (L*£,S(t)x). 

Proof. Fix some arbitrary x £ B and set xt = J* * S{s)x ds. One then has 

lim h~ l (S{h)x t - x t ) = lim h^i / S{s)xds - / S{s)xds) 
h^O h^O \J h Jo ) 

= XxmhT l {\ S(s)x ds - f S(s)x ds) = S(t)x - x , 

h^O \Jt Jo ' 

where the last equality follows from the strong continuity of S. This shows that x t £ V(L). Since 
t~ 1 x t — > x as t —>■ and since x was arbitrary, it follows that V(L) is dense in B. To show that it 
is invariant under S, note that for x £ V(L) one has 

lim h^iSifySMx - S(t)x) = S(t) lim h- l {S{h)x - x) = S(t)Lx , 

h— »0 h— >0 

so that S(t)x £ V(L) and LS(t)x = S(t)Lx. To show that it this is equal to dtS(t)x, it suffices to 
check that the left derivative of this expression exists and is equal to the right derivative. This is 
left as an exercise. 

To show that the second claim holds, it is sufficient (using the strong continuity of S) to check 
that it holds for x £ V(L). Since one then has S(t)x £ V(L) for every t, it follows from the 
definition (4.2) of V(L) that t S(t)x is differentiable and that its derivative is equal to LS(t)x. 

□ 

It follows as a corollary that no two semigroups can have the same generator (unless the semi- 
groups coincide of course), which justifies the notation S(t) = e Lt that we are occasionally going 
to use in the sequel. 

Corollary 4.8 If a function x: [0, 1] — > V{L) satisfies dtXt = Lxt for every t £ [0,1], then 
Xt = S(t)xo- In particular, no two distinct Co-semigroups can have the same generator. 

Proof. It follows from an argument almost identical to that given in the proof of Proposition 4.7 
that the map t ^ S(t)xT-t is continuous on [0, T] and differentiable on (0, T). Computing its 
derivative, we obtain dtS(t)xT-t = LS(t)xT-t — S(t)Lxx-t = 0, so that xt = S(T)xq. □ 

Exercise 4.9 Show that the semigroup S(t) on L 2 (R) given by 

(smxo = m + t) , 

is strongly continuous and that its generator is given by L = dg. with V(L) = H 1 . Similarly, show 
that the heat semigroup on L 2 (R) given by 



W/)(0 = ^= ! /exp(-K_!f) /(??) ^, 



is strongly continuous and that its generator is given by L = 9| with V(L) = H 2 . Hint: Use 
Exercise 4.2 to show strong continuity. 
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Remark 4.10 We did not make any assumption on the structure of the Banach space B. However, 
it is a general rule of thumb (although this is not a theorem) that semigroups on non-separable 
Banach spaces tend not to be strongly continuous. For example, neither the heat semigroup nor 
the translation semigroup from the previous exercise are strongly continuous on L°°(R) or even 
on Cfe(R), the space of all bounded continuous functions. 

Recall now that the resolvent set for an operator L consists of those A G C such that the 
operator A — L is one to one. For A in the resolvent set, we denote by R\ = (A — L) _1 the 
resolvent of L. It turns out that the resolvent of the generator of a Co-semigroup can easily be 
computed: 

Proposition 4.11 Let S(t) be a Co-semigroup such that \\S{t)\\ < Me at for some constants M 
and a. If ReA > a, then A belongs to the resolvent set of L and one has the identity R\x = 
/ °° e- xt S(t)x dt. 

Proof. By the assumption on the bound on S, the expression Z\ = f °° e~ xt S(t)x dt is well- 
defined for every A with ReA > a. In order to show that Z\ = R\, we first show that Z\x G V(L) 
for every ieS and that (A — L)Z\x = x. We have 

/•oo 

LZ x x = lim h,- l (S{h)Z x x - Z x x) = lim hr 1 \ e - xt (S(t + h)x - S(t)x) dt 

h— >0 h— >0 Jo 

,pA/i i poo „\h . 

= limf / e~ M S(t)xdt- — / e- xt S(t)xdt) 

h^o\ h Jo h Jo ' 

= \Z x x — x , 

which is the required identity. To conclude, it remains to show that A — L is an injection on V(L). 
If it was not, we could find x G T>(L) \ {0} such that Lx = Ax. Setting xt = e xt x and applying 
Corollary 4.8, this yields S(t)x = e xt x, thus contradicting the bound \\S(t)\\ < Me at if ReA > a. 

□ 

We can deduce from this that: 

Proposition 4.12 The generator L of a Co-semigroup is a closed operator. 

Proof. We are going to use the characterisation of closed operators given in Exercise 4.4. Shifting 
L by a constant if necessary (which does not affect it being closed or not), we can assume that 
a = 0. Take now a sequence x n G V{L) such that {x n } and {Lx n } are both Cauchy in B and set 
x = lim^oo x n and y = lim^oo Lx n . Setting z n = (1 - L)x n , we have lim n ^oo z n = x-y. 
On the other hand, we know that 1 belongs to the resolvent set, so that 

x = lim x n = lim R\z n = R\(x — y) . 

n— »oo n— »oo 

By the definition of the resolvent, this implies that x G V(L) and that x — Lx = x — y, so that 
Lx = y as required. □ 

We are now ready to give a full characterisation of the generators of Co-semigroups. This is 
the content of the following theorem: 

Theorem 4.13 (Hille-Yosida) A closed densely defined operator L on the Banach space B is the 
generator of a Co-semigroup S(t) with \\S(t)\\ < Me at if and only if all A with ReA > a lie in its 
resolvent set and the bound < M (ReA — a)~ n holds therefor every n > 1. 
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Proof. The generator L of a Co-semigroup is closed by Proposition 4.12. The fact that its resolvent 
satisfies the stated bound follows immediately from the fact that 

B%x= ■■■ e- X{tl+ - +tn) S(h + ... + t n )x dh • • • dt n 
Jo Jo 

by Proposition 4.1 1. 

To show that the converse also holds, we are going to construct the semigroup S(t) by using 
the so-called 'Yosida approximations' L x = XLR X for L. Note first that limA^oo LR\x = for 
every x G B: it obviously holds for x G T>(L) since then = 1 1 -R A La: || < II-RaIIII^II < 

M(ReA - a) _1 ||La:||. Furthermore, ||Li? A a:|| = \\XR\X - x\\ < (MA(A - a)" 1 + l)||x|| < 
(M + 2) || x || for A large enough, so that limA^oo LR x x = for every x by a standard density 
argument. 

Using this fact, we can show that the Yosida approximation of L does indeed approximate L 
in the sense that limA^oo L x x = Lx for every x G V(L). Fixing an arbitrary x G V{L), we have 

lim ||L A x - Lx\\ = lim ||(Ai? A - l)Lx\\ = lim ||Li? A La:|| = . (4.3) 

A^oo A— »oo A— »oo 

T + t n L n 

Define now a family of bounded operators S\(t) by S\(t) = e x = J2n>o ~nT- series 
converges in the operator norm since L\ is bounded and one can easily check that S\ is indeed a 
Co-semigroup (actually a group) with generator L\. Since L\ = —A + \ 2 R\, one has for A > a 
the bound 

\\Sx(t)\\ = ^ E tnX2n \ RnJ = Mexp(-At + = Mexp(-^) , (4.4) 

~i n\ V A — a' VA — a> 

n>0 

so that lim sup A ^. 0C ||<S A (t)|| < Me"'. Let us show next that the limit lim^oo S\(t)x exists for 
every t > and every x G B. Fixing A and fi large enough so that max{||S , A (t)||, ||<S^(£)||} < 
Me 2at , and fixing some arbitrary t > 0, we have for s G [0, t] 

\\d s S x (t - s)S lx (s)x\\ = \\S x (t - s)(L M - L x )S^s)x\\ = \\S x (t - s)S lx (s)(L lx - L x )x\\ 
<M 2 e 2at \\(L^-L x )x\\ . 

Integrating this bound between and t, we obtain 

\\S x (t)x - S^t)x\\ < M 2 te 2at \\L^x - L x x\\ , (4.5) 

which converges to for every x G T>(L) as A, \i —> oo since one then has L x x — > Lx. We can 
therefore <fe/wi<? a family of linear operators S*(t) by S(t)x = limA^oo S x (t)x. 

It is clear from (4.4) that \\S(t)\\ < Me at and it follows from the semigroup property of S x that 
S(s)S(t) = S(s + t). Furthermore, it follows from (4.5) and (4.3) that for every fixed x G V{L), 
the convergence S x (t)x — ► 5(t)x is uniform in bounded intervals of t, so that the map t h-> S(t)x 
is continuous. Combining this wit our a priori bounds on the operator norm of S(t), it follows 
from Exercise 4.2 that S is indeed a Co-semigroup. It remains to show that the generator L of S 
coincides with L. Taking first the limit A — > oo and then the limit t — > in the identity 

t^iSxiQx - x) = r 1 [ S x (s)L x xds , 
Jo 



we see that x G V(L) implies x G V(L) and Lx = Lx, so that L is an extension of L. However, 
for A > a, both A — L and A — L are one-to-one between their domain and B, so that they must 
coincide. □ 
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One might think that the resolvent bound in the Hille-Yosida theorem is a consequence of the 
fact that the spectrum of L is assumed to be contained in the half plane {A : Re A < a}. This 
however isn't the case, as can be seen by the following example: 

Example 4.14 We take B = n>1 C 2 (equipped with the usual Euclidean norms) and we define 
L = n>1 L n , where L n :C 2 — ► C 2 is given by the matrix 

in n 
in 

In particular, the resolvent of L n is given by 



R (n) 1 /' A - " ' » 



A (A - in) 2 V A - in 
so that one has the upper and lower bounds 



< ||i?V"ll < . ,, + v2 



|A — in\ 2 |A — in\ 2 |A — in\ 

Note now that the resolvent R\ of L satisfies ||i?A|| = su P n >i II-^a II- On one nan d, this shows 
that the spectrum of L is given by the set {in 2 : n > 1}, so that it does indeed lie in a half plane. 
On the other hand, for every fixed value a > 0, we have ||i? a +m|| > ^z, so that the resolvent 
bound of the Hille-Yosida theorem is certainly not satisfied. 

It is therefore not surprising that L does not generate a Co-semigroup on B. Even worse, 
trying to define S(t) = ® n >iS n (t) with S n (t) = e Lnt results in || SViWH > nt, so that S(t) is an 
unbounded operator for every t > 0! 



4.1.1 Adjoint semigroups 

It will be very useful in the sequel to have a good understanding of the behaviour of the adjoints 
of strongly continuous semigroups. The reason why this is not a completely trivial topic is that, in 
general, it is simply not true that the adjoint semigroup S*(t): B* —> B* of a strongly continuous 
semigroup is again strongly continuous. This is probably best illustrated by an example. 

Take B = C([0, 1], R) and let S(t) be the heat semigroup (with Neumann boundary conditions, 
say). Then S*(t) acts on finite signed measures by convolving them with the heat kernel. While it 
is true that S*(t)u — > u weakly as t —> 0, it is not true in general that this convergence is strong. 
For example, S*(t)S x does not converge to S x in the total variation norm (which is the dual to the 
supremum norm on C([0, 1], R)). However, this difficulty can always be overcome by restricting 
S*(t) to a slightly smaller space than B*. This is the content of the following result: 

Proposition 4.15 If S(t) is a Co-semigroup on B, then S*{t) is a Co-semigroup on the closure B^ 
ofV(L*) in B* and its generator is given by the restriction of L* to the set T>(L^) = {x G 
V(L*) : L*x G 6+}. 

Proof. We first show that S*(t) is strongly continuous on B^ and we will then identify its generator. 
Note first that it follows from Proposition 4.7 that S*(t) maps V(L*) into itself, so that it does 
indeed define a family of bounded operators on BK Since the norm of S*(t) is 0(1) as t — ► and 
since V(L*) is dense in B^ by definition, it is sufficient to show that lim^o S*(t)x = x for every 
x G V(L*). It follows immediately from Proposition 4.7 that for x G V(L*) one has the identity 

S*(t)x-x= /* S*(s)L*xds , 
Jo 
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from which we conclude that S*(t)x — > x. 

It follows from Proposition 4.11 that the resolvent R\ for S*(t) on B^ is nothing but the 
restriction of R* x to B\ This immediately implies that THjJ) is given by the stated expression. □ 

Remark 4.16 As we saw in the example of the heat semigroup, B^ is in general strictly smaller 
than B*. This fact was first pointed out by Phillips in [Phi55]. In our example, B* consists of 
all finite signed Borel measures on [0, 1], whereas B^ only consists of those measures that have a 
density with respect to Lebesgue measure. 

Even though B^ is in general a proper closed subspace of 23*, it is large enough to be dense in 
£>*, when equipped with the (much weaker) weak-* topology. This the content of our last result in 
the theory of strongly continuous semigroups: 

Proposition 4.17 For every £ G B* there exists a sequence £ n G B^ such that t n (x) — ► t{x) for 
every x G B. 

Proof. It suffices to choose i n = nR* n t. Since we have £ n G V{L*), it is clear that i n 6 B^ . On 
the other hand, we know from the proof of the Hille-Yosida theorem that linin^oo \\nR n x— x\\ = 
for every x G B, from which the claim follows at once. □ 

4.2 Semigroups with selfadjoint generators 

In this section, we consider the particular case of strongly continuous semigroups consisting of 
self-adjoint operators on a Hilbert space TL. The reason why this is an interesting case is that it 
immediately implies very strong smoothing properties of the operators S(t) in the sense that for 
every t > 0, they map TL into the domain of arbitrarily high powers of L. Furthermore, it is 
very easy to obtain explicit bounds on the norm of S(t) as an operator from TL into one of these 
domains. We will then see later in Section 4.3 on analytic semigroups that most of these properties 
still hold true for a much larger class of semigroups. 

Let L be a selfadjoint operator on TL which is bounded from above. Without loss of generality, 
we are going to assume that it is actually negative definite, so that (x, Lx) < for any x G TL. 
In this case, we can use functional calculus (see for example [RS80], in particular chapter VIII in 
volume I) to define selfadjoint operators f(L) for any measurable map /: R — > R. This is because 
the spectral decomposition theorem can be formulated as: 

Theorem 4.18 (Spectral decomposition) Let L be a selfadjoint operator on a separable Hilbert 
space TL. Then, there exists a measure space an isomorphism K:TL — > L 2 (M,n), and 

a function /l'M K such that via K, L is equivalent to the multiplication operator by Jl on 
L 2 (M,fi). In other words, one has L = K~ x f L K and KV(L) = {g : f L g G L 2 (M, fi)}. 

In particular, this allows one to define f(L) = K~ 1 (f o f£)K, which has all the nice prop- 
erties that one would expect from functional calculus, like for example (fg)(L) = f(L)g(L), 
\\f(L)\\ = \\f\\L°°(M,iJ.)> etc - Defining S(t) = e Lt , it is an exercise to check that S is indeed a 
Co-semigroup with generator L (either use the Hille-Yosida theorem and make sure that the semi- 
group constructed there coincides with 5 or check 'by hand' that S(t) is indeed Co with generator 
L). 

The important property of semigroups generated by selfadjoint operators is that they do not 
only leave V(L) invariant, but they have a regularising effect in that they map H into the domain 
of any arbitrarily high power of L. More precisely, one has: 
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Proposition 4.19 Let L be self-adjoint and negative definite and let S(t) be the semigroup on H 
generated by L. Then, S(t) maps 7i into the domain of (I — L) a for any a,t > and there exist 
constants C a such that ||(1 - L) a S(t)\\ < C a (l + t~ a ). 

Proof. By functional calculus, it suffices to show that sup A>0 (l + \) a e~ xt < C a (l + t~ a ). One 
has 

sup \ a e~ xt = t~ a sup (Xt) a e' xt = t~ a sup X a e~ x = a a e- a t~ a . 

A>0 A>0 A>0 

The claim now follows from the fact that there exists a constant C' a such that (1 — X) a < C' a (l + 
(-X) a ) for every A < 0. □ 

4.3 Analytic semigroups 

Obviously, the conclusion of Proposition 4.19 does not hold for arbitrary Co-semigroups since the 
group of translations from Example 4.9 does not have any smoothing properties. It does however 
hold for a very large class of semigroups, the so-called analytic semigroups. The study of these 
semigroups is the object of the remainder of this section, and the equivalent of Proposition 4. 19 
is going to be one of our two main results. The other result is a characterisation of generators 
for analytic semigroups that is analogous to the Hille-Yosida theorem for Co-semigroups. The 
difference will be that the role of the half-plane ReA > a will be played by the complement of a 
sector of the complex plane with an opening angle strictly smaller than ir. 

Recall that a semigroup S on a Banach space B is analytic if there exists 9 > such that the 
map t i — ^ S(t) (taking values in C(B)) admits an analytic extension to the sector S$ = {A G C : 
| argA| < 9}, satisfies the semigroup property there, and is such that t ^ Sy(t) = S(e llf t) is 
a strongly continuous semigroup for every \tp\ < 9. If 9 is the largest angle such that the above 
property holds, we call S analytic with angle 9. The strong continuity of 1 1— > S{e %ip t) implies that 
there exist constants M(ip) and a(tp) such that 

113,0)11 < M(^" . 

Using the semigroup property, it is not difficult to show that M and a can be chosen bounded over 
compact intervals: 

Proposition 4.20 Let S be an analytic semigroup with angle 9. Then, for every 9' < 9, there exist 
M and a such that \\S,p(t)\\ < Me at for every t > and every \tp\ < 9'. 

Proof. Fix 9' G (0, 9), so that in particular 9 < tt/2. Then there exists a constant C such that, 
for every t > and every (p with \ip\ < 9', there exist numbers t+, t- G [0, Ct] such that te lip = 
t + e ie ' + t_e- ie '. It follows that one has the bound ||^(t)|| < M(9')M(-9')e a ^"> ct+a( - e ' )Ct , 
thus proving the claim. □ 

We next compute the generators of the semigroups obtained by evaluating S along a 'ray' 
extending out of the origin into the complex plane: 

Proposition 4.21 Let S be an analytic semigroup with angle 9. Then, for \<p\ < 9, the generator 
L v of Sp is given by L v = e llp L, where L is the generator of S. 

Proof. Recall Proposition 4. 1 1 showing that for ReA large enough the resolvent R\ for L is given 
by 

roc 

R x x = / e - xt S(t)xdt . 
Jo 
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Since the map t^e xt S(t) is analytic in Sg by assumption and since, provided again that ReA is 



large enough, it decays exponentially to as \t\ 
to obtain 



R\x 



— > oo, we can deform the contour of integration 

Xein S(e iv t)xdt . 



Denoting by R^ the resolvent for the generator L v of S^, we thus have the identity R\ 



e lip R^ el[p , which is equivalent to (A — L)~ 
stated. 



(A - e -i ^L„) _1 , thus showing that = e^L as 



□ 



We now use this to show that if S is an analytic semigroup, then the resolvent set of its gener- 
ator L not only contains the right half plane, but it contains a larger sector of the complex plane. 
Furthermore, this characterises the generators of analytic semigroups, providing a statement simi- 
lar to the Hille-Yosida theorem: 

Theorem 4.22 A closed densely defined operator L on a Banach space B is the generator of an 
analytic semigroup if and only if there exists 9 G (0, ^) and a > such that the spectrum of L is 
contained in the sector 

S e , a = {A G C : arg(a - A) G [-§ + 9, § - 9]} , 

and there exists M > such that the resolvent R\ satisfies the bound \\R\\\ < Md(X, Sq^)~ 1 for 
every A G" S e , a - 

Proof. The fact that generators of analytic semigroups are of the prescribed form is a consequence 
of Proposition 4.21 and the Hille-Yosida theorem. 

To show the converse statement, let L be such an 
operator, let ip G (0, 9), let b > a, and let 7^ be 
the curve in the complex plane obtained by going in v 
a counterclockwise way around the boundary of S^j, v N 

(see the figure on the right). For t with | argt| < ip, N v 

define S(t) by 



S(t) 



1 

1 

2vri 



e tz R 7 dz 



(4.6) 



\z - Ly 1 dz 




It follows from the resolvent bound that ||i? 2 || is uni- y 
formly bounded for z G 7^. Furthermore, since ^ ' 
I argt| < if, it follows that e tz decays exponentially 
as \ z\ —> 00 along 7^5, so that this expression is well- 
defined, does not depend on the choice of b and ip, 

and (by choosing 99 arbitrarily close to 9) determines an analytic function t S(t) on the sector 
{t : I argt| < 9}. As in the proof of the Hille-Yosida theorem, the function (x,t) i-> S(t)x is 
jointly continuous because the convergence of the integral defining S is uniform over bounded 
subsets of {t : \ argt| < ip} for any \ip\ < 9. 

It therefore remains to show that S satisfies the semigroup property on the sector {t : | arg t \ < 
9} and that its generator is indeed given by L. Choosing s and t such that | args| < 9 and 
I argi| < 9 and using the resolvent identity R z — R z < = (z' — z)R z R z i, we have 



S(s)S(t) = - 



1 

4^2 



e tz+sz R Z R Z , dzdz' = 



IV, 6' J l V ,b 



1 

"4^2 



tz+sz' 



R z — Rz 



dz dz' 
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If r e sz ' If f e tz ' 

= -T-?/ etZR z / ~, dz'dz-—, e sz R z dz'dz. 

4tH J^ b \ M z'-z 4vr 2 J lv b , J lv b z> -z 

Here, the choice of b and b' is arbitrary, as long as b / b' so that the inner integrals are well- 
defined, say b' > b for definiteness. In this case, since the contour 7^ can be 'closed up' to the 

sz' 

left but not to the right, the integral J 7 dz' is equal to 2iite sz for every z G 7^, whereas 

the integral with b and 6' inverted vanishes, so that 

S(s)S(t) = —7 f e« +s)z R z = S(s + t) , 

as required. The continuity of the map t ^ S(t)x is a straightforward consequence of the resolvent 
bound, noting that it arises as a uniform limit of continuous functions. Therefore S is a strongly 
continuous semigroup; let us call its generator L and R\ the corresponding resolvent. 

To show that L = L, it suffices to show that R\ = R\, so we make use again of Proposi- 
tion 4.11. Choosing ReA > b so that Re(z — A) < for every z G 7^, we have 

/•oo 1 /*oo r 

R x = / e - xt S(t) dt = — : / / e t(z ~ X) R z dz dt 

JO 27Ti Jo J llfi b 

= ^-f 'e«*-»dtR z dz = ± I -^-dz = R x . 

2-ki J^ b Jo 2-ki J^ b z - A 

The last inequality was obtained by using the fact that ||i? z || decays like l/\z\ for large enough z 
with I arg z\ < | + 93, so that the contour can be 'closed' to enclose the pole at z = A. □ 

As a consequence of this characterisation theorem, we can study perturbations of generators 
of analytic semigroups. The idea is to give a constructive criterion which allows to make sure that 
an operator of the type L = Lq + B is the generator of an analytic semigroup, provided that Lq 
is such a generator and B satisfies a type of 'relative total boundedness' condition. The precise 
statement of this result is: 



Theorem 4.23 Let Lq be the generator of an analytic semigroup and let B:V{B) — > B be an 
operator such that 

• The domain V(B) contains V(L ). 

• For every e > there exists C > such that \\Bx\\ < £\\Lox\\ + C\\x\\ for every x G V{Lq). 

Then the operator L = L + B (with domain V(L) = V(L )) is also the generator of an analytic 
semigroup. 

Proof. In view of Theorem 4.22 it suffices to show that there exists a sector Sg^ a containing the 
spectrum of L and such that the resolvent bound R\ < Md(X, Sg^)" 1 holds away from it. 
Denote by the resolvent for Lq and consider the resolvent equation for L: 

(X-L -B)x = y, x£V(L ). 

Since (at least for A outside of some sector) x belongs to the range of R x , we can set x = R\z so 
that this equation is equivalent to 

z - BR° x z = y . 
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The claim therefore follows if we can show that there exists a sector Sq a and a constant c < 1 
such that \\BR X \\ < c for A Se, a - This is because one then has the bound 

\\Rxy\\ = \\RU\ < 



l-c""" 

Using our assumption on B, we have the bound 

\\BR° x z\\ < e\\L R° x z\\ + C\\R° x z\\ . (4.7) 

Furthermore, one has the identity L$R Q X = XR X — 1 and, since Lq is the generator of an analytic 
semigroup by assumption, the resolvent bound \\R X \\ < Md(X, Sa^ 1 for some parameters a, b. 
Inserting this into (4.7), we obtain the bound 

||^°||< (£|A|+C)M I c 
d(X, S aj b) 

Note now that by choosing 9 G (0, a), we can find some 5 > such that d(X, S a ^) > <5 1 A | for all 
A Sg A and all a > 1 V (b + 1). We fix such a # and we make e sufficiently small such that one 
has both e < 1/4 and eJ -1 < 1/4. 

We can then make a large enough so that d(X, S a ,b) > 4CM for A S^a, so that ||-B-R°II < 
3/4. for these values of A, as requested. □ 

Remark 4.24 As one can see from the proof, one actually needs the bound \\Bx\\ < e||Lox|| + 
C||x|| only for some particular value of e that depends on the characteristics of Lq. 

As a consequence, we have: 

Proposition 4.25 Let f G L°°(R). Then, the operator 

(Lg)(x) = g"(x) + f(x)g'(x) , 

on L 2 (K) with domain V{L) = H 2 is the generator of an analytic semigroup. 

Proof. It is well-known that the operator (Log)(x) = g"(x) with domain V(L) = H 2 is self- 
adjoint and negative definite, so that it is the generator of an analytic semigroup with angle 9 = 

vr/2. 

Setting Bg = fg', we have for g G H 2 the bound 

\\Bgf = I f\x){g'{x)fdx < H/Hioo <</,</) = -H/HiocW) < ||/|| L ~||<7||||Lo<7|| ■ 
JR 

It now suffices to use the fact that 2\xy\ < ex 2 + s~ l y 2 to conclude that the assumptions of 
Theorem 4.23 are satisfied. □ 

Similarly, one can show: 

Exercise 4.26 Show that the generator of an elliptic diffusion with smooth coefficients on a com- 
pact Riemannian manifold M. generates an analytic semigroup on L 2 {M, q), where g is the vol- 
ume measure given by the Riemannian structure. 
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4.4 Interpolation spaces 

The remainder of this section will be devoted to the study of the domains of fractional powers 
of the generator L of an analytic semigroup S(t). For simplicity, we will assume throughout this 
section that there exist M > and w > such that < Me~ wt , thus making sure that 

the resolvent set of L contains all the right half of the complex plane. The general case can be 
recovered easily by 'shifting the generator to the left'. For a > 0, we define negative fractional 
powers of L by 

1 f°° 

( _ L) -a« / e -l S (t)dt, (4.8) 

F(a) Jo 

which is a bounded operator by the decay assumption on ||£(i)||. Since T(l) = 1, note that if a = 1 
one does indeed recover the resolvent of L evaluated at 0. Furthermore, it is straightforward to 
check that one has the identity (— L)~ a (— L)~^ = (—L)" 01 " 13 , which together justify the definition 
(4.8). 

Note that it follows from this identity that (— L)~ a is injective for every a > 0. Indeed, given 
some a > 0, one can find an integer n > such that (— L)~ n = (—L)~ n+a (—L)~ a . A failure 
for (— L)~ a to be injective would therefore result in a failure for (—L) n and therefore (— L)^ 1 
to be injective. This is ruled out by the fact that belongs to the resolvent set of L. We can 
therefore define (—L) a as the unbounded operator with domain V((— L) a ) = range(— L)~ a given 
by the inverse of (— L)~ Q . This definition is again consistent with the usual definition of (— L) a 
for integer values of a. This allows us to set: 

Definition 4.27 For a > and given an analytic semigroup S on a Banach space B, we define 
the interpolation space B a as the domain of (— L) a endowed with the norm ||x|| a = ||(— L) a a;||. 
We similarly define B- a as the completion of B for the norm ||x||_ a = ||(— L)~ a x\\. 

Remark 4.28 If the norm of S(t) grows instead of decaying with t, then we use A — L instead of 
— L for some A sufficiently large. The choice of different values of A leads to equivalent norms on 

B a . 

Exercise 4.29 Show that the inclusion B a C Bp for a > (3 hold, whatever the signs of a and (3. 
Exercise 4.30 Show that for a G (0, 1) and x £ V(L), one has the identity 

(-L) a x = f° t a -\t - L)-\-L)x dt . (4.9) 

vr Jo 

Hint: Write the resolvent appearing in (4.9) in terms of the semigroup and apply the resulting 
expression to (—L)~ a x, as defined in (4.8). The aim of the game is then to perform a smart 
change of variables. 

Exercise 4.31 Use (4.9) to show that, for every a 6 (0, 1), there exists a constant C such that the 
bound \\(-L) a x\\ < C7||Lx|| Q? ||a;|| 1 " a holds for every x G V(L). 

Hint: Split the integral as J °° = J K + f£? and optimise over K. (The optimal value for K will 
turn out to be proportional to ||Lx||/||x||.) In the first integral, the identity (t — L) _1 (— L) = 
1 — t(t — Ly 1 might come in handy. 

Exercise 4.32 Let L be the generator of an analytic semigroup on B and denote by B a the corre- 
sponding interpolation spaces. Let B be a (possibly unbounded) operator on B. Using the results 
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from the previous exercise, show that if there exists a G [0, 1) such that B a C V{B) so that B is 
a bounded operator from B a to B, then one has the bound 



|Bx|| < C{e\\Lx\\+s- a/{1 - a) \ 



x 



for some constant C > and for all e < 1. In particular, L + B is also the generator of an analytic 
semigroup on B. 

Hint: The assumption on B implies that there exists a constant C such that ||Bx|| < C||ar|| Q . 

Exercise 4.33 Let L and B be as in Exercise 4.32 and denote by Sb the analytic semigroup with 
generator L + B. Use the relation R\ — R\ = R\BR\ to show that one has the identity 



S B (t)x = S(t)x + /* S(t - s)BS B (s)xds . 
Jo 



Hint: Start from the right hand side of the equation and use an argument similar to that of the 
proof of Theorem 4.22. 

Exercise 4.34 Show that (—L) a commutes with S(t) for every t > and every a G R. Deduce 
that S(t) leaves B a invariant for every a > 0. 

Exercise 4.35 It follows from Theorem 4.22 that the restriction of the adjoint L* of the gener- 
ator of an analytic semigroup on B to the 'semigroup dual' space B^ is again the generator of an 
analytic semigroup on B^. Denote by B\ the corresponding interpolation spaces. Show that one 

has fit = V{(-L^) a ) C V({(-L) a )*) = (B^ a )* for every a > 0. 

We now show that an analytic semigroup S(t) always maps B into B a for t > 0, so that it has a 
'smoothing effect'. Furthermore, the norm in the domains of integer powers of L can be bounded 
by: 

Proposition 4.36 For every t > and every integer k > 0, S(t) maps B into V{L k ) and there 
exists a constant such that 



\L k S(t)x\\ < ^ 



for every t 6 (0, 1]. 



Proof. In order to show that S maps B into the domain of every power of L, we use (4.6), together 
with the identity LR\ = \R\ — 1 which is an immediate consequence of the definition of the 
resolvent R\ of L. Since b e tz dz = for every t such that | &igt\ < ip and since the domain 

of L k is complete under the graph norm, this shows that S(t)x G V{L k ) and 

L k S{t) = — I z k e tz R z dz . 

It follows that there exist positive constants Cj such that 

\\L k S(t)\\ < — / \z\ k \e tz \\\R z \\ d\z\ < a / (1 + x) k e~ C2t{x - OA \l + x)~ 1 dx . 
27T J lif h Jo 

Integrating by parts k — 1 times, we obtain 

\\L k S m <^ T fe-^^dx= C -^, 
which implies the announced bound. □ 
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It turns out that a similar bound also holds for interpolation spaces with non-integer indices: 

Proposition 4.37 For every t > and every a > 0, S(t) maps B into B a and there exists a 
constant C a such that 

\\(-L) a S(t)x\\ < ^ (4.10) 

for every t G (0, 1]. 

Proof. The fact that S(t) maps B into B a follows from Proposition 4.36 since there exists n such 
that V{L n ) C B a . We assume again that the norm of S(t) decays exponentially for large t. The 
claim for integer values of a is known to hold by Proposition 4.36, so we fix some a > which is 
not an integer. Note first that (— L) a = (— L) a ~ M_1 (— L) [a]+1 , were we denote by [a] the integer 
part of a. We thus obtain from (4.8) the identity 

(—"nM+i roc 
(-L) a S(t) = -i-j — / s M ~ a L^ +1 S(t + s)ds. 
T([q] - a + 1) Jo 

Using the previous bound for k = [a], we thus get for some C > the bound 

roo e —w(t+s) roc Ja]—a 

\\(-L) a S(t)\\ < C / s [a] ~ a j— r— r- ds < Ct- a / r — T ds , 

" Jo (t + s)M +1 ~ Jo (l + s)M +1 

where we used the substitution s ts. Since the last function is integrable for every a > 0, the 
claim follows at once. □ 



Exercise 4.38 Using the fact that S(t) commutes with any power of its generator, show that S(t) 
maps B a into Bp for every a, (3 G R and that, for (3 > a, there exists a constant C Q)( g such that 

\\S(jt)x\\ Bp < C a ,f3\\x\\ Ba t a -P for all t G (0, 1]. 

Exercise 4.39 Using the bound from the previous exercise and the definition of the resolvent, 
show that for every a G R and every [3 G [a, a + 1) there exists a constant C such that the bound 

\\{t - L)~ l x\\ Bfj < C(l + ^~ a-1 ||x||B a holds for all t > 0. 

Exercise 4.40 Consider an analytic semigroup S(t) on B and denote by B a the corresponding 
interpolation spaces. Fix some 7 G R and denote by S(t) the semigroup S viewed as a semigroup 
on B 1 . Denoting by B a the interpolation spaces corresponding to S(t), show that one has the 
identity B a = B 1+a for every a G R. 

Another question that can be answered in a satisfactory way with the help of interpolation 
spaces is the speed of convergence of S(t)x to x as t — > 0. We know that if x G V(L), then 
1 1 — ^ S(t)x is differentiable at t = 0, so that || £(i)a; — x\\ = t\\Lx\\ + o(t). Furthermore, one can in 
general find elements x G B so that the convergence S(t)x —> x is arbitrarily slow. This suggests 
that if a; G V((-L) a ) for a G (0, 1), one has \\S(t)x - x\\ = 0(t a ). This is indeed the case: 

Proposition 4.41 Let S be an analytic semigroup with generator L on a Banach space B. Then, 
for every a G (0, 1), there exists a constant C a , so that the bound 

\\S(t)x-x\\ <C a t a \\x\\ Ba (4.11) 

holds for every x G B a and every t G (0,1]. 
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Proof. By density, it is sufficient to show that (4.11) holds for every x G D(L). For such an x, one 
has indeed the chain of inequalities 



\\S(t)x-x\ 



f S(s)Lxdx = f\-L) 1 - a S(s)(-L) a xdx 
Jo Jo 



<C\\x\\ Ba [ t \\(-L) 1 - a S(s)\\dx<C\\x\\ Ba [* s"- 1 ds = C\\x\\ B J a 
Jo Jo 

Here, the constant C depends only on a and changes from one expression to the next. 



□ 



We conclude this section with a discussion on the interpolation spaces arising from a perturbed 
analytic semigroup. As a consequence of Exercises 4.31, 4.32, and 4.39, we have the following 
result: 

Proposition 4.42 Let Lq be the generator of an analytic semigroup on B and denote by B® the 
corresponding interpolation spaces. Let B be a bounded operator from B\ to B for some a G 
[0, 1). Let furthermore B y be the interpolation spaces associated to L = Lq + B. Then, one has 
B y = B® for every 7 G [0,1]. 

Proof. The statement is clear for 7 = and 7 = 1. For intermediate values of 7, we will show 
that there exists a constant C such that C _1 ||(— Lo) 7 x|| < ||(— L) 7 x|| < C\\(— Lo) 7 x|| for every 
x G V(L ). 

Since the domain of L is equal to the domain of Lq, we know that the operator BR t is bounded 
for every t > 0, where R t is the resolvent of L. Making use of the identity 



Rt — R® + R^BRt , 



(4.12) 



(where we similarly denoted by R® the resolvent of Lq) it then follows from Exercise 4.39 and the 
assumption on B that one has for every x G B® the bound 

\\BR t x\\ < \\BR%x\\ + \\BR%BR t x\\ < C{\\R^x\\ B o + \\R$BR t x\\ B o) 
< C(l + t^^WxWso + C{1 + t) a - l \\BR t x\\ . 



It follows that, for t sufficiently large, one has the bound 

\\BR t x\\ < Ca + t) 0-7-1 ! 



x B o 
"7 



(4.13) 



(Note that this bound is also valid for 7 = 0.) Since one furthermore has the resolvent identity 
R s = Rt + (t — s)R s Rt, this bound can be extended to alH > by possibly changing the value 
of the constant C. 

We now show that ||(— L) 7 x|| can be bounded by ||(— Lo) 7 x||. We make use of Exercise 4.31 
to get, for x G V{Lq), the bound 



MK = c 



< c 



/ f-^LRtxdt 
Jo 

/ f^LoB^xdt +C t 7_1 ||(L -R? + l)BR t x \\dt 
Jo Jo 



roo 

< \\x\\ B o +C / t^WBRtxWdt 

1 Jo 

roo 

< \\x\\ B o + C / t^il + i) a_7_1 dt\\x\\ B o . 

1 Jo 1 
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Here, we used again the identity (4.12) to obtain the first inequality and we used (4.13) in the last 
step. Since this integral converges, we have obtained the required bound. 
In order to obtain the converse bound, we have similarly to before 

PCX) 

\\x\\b° < IMk + C / t^WBRtxW dt . 
1 1 Jo 

Making use of the resolvent identity, this yields for arbitrary K > the bound 

/*oo /*oo 

IM| B o < + C / P^WBRt+KxW dt + CK / f-^WBRt+KRtxW dt 

1 Jo Jo 

< IMk + c / f-\t + if) a- T -1 dt\\x\\ B o + ck / + ty 1 dt\\x\\ 

Jo 1 Jo 

< \\x\\By + OK -0-1 ||x|| B o + CK\\x\\ . 

By making K sufficiently large, the prefactor of the second term can be made smaller than |, say, 
so that the required bound follows by the usual trick of moving the term proportional to ||x||go to 
the left hand side of the inequality. □ 



Exercise 4.43 Assume that B = TC is a Hilbert space and that the antisymmetric part of L is 
'small' in the sense that V(L*) = V{L) and, for every e > there exists a constant C such that 
|| (L — L*)x\\ < s\\Lx\\ + C||x|| for every x G V(L). Show that in this case the space H- a can be 
identified with the dual of H a (under the pairing given by the scalar product of Ti) for a G [0, 1]. 

It is interesting to note that the range [0, 1] appearing in the statement of Proposition 4.42 
is not just a restriction of the technique of proof employed here. There are indeed examples 
of perturbations of generators of analytic semigroups of the type considered here which induce 
changes in the corresponding interpolation spaces B a for a [0, 1]. 

Consider for example the case B = L 2 ([0, 1]) and Lq = A, the Laplacian with periodic 
boundary conditions. Denote by the corresponding interpolation spaces. Let now 5 € (0, 1) be 
some arbitrary index and let g 6 B be such that g B®. Such an element g exists since A is an 
unbounded operator. Define B as the operator with domain C 1 ([0, 1]) C B given by 

(Bf)(x) = f'(l/2)g(x) . (4.14) 

It turns out that B\ C C 1 ([0, 1]) for a > 3/4 (see for example Lemma 6.13 below), so that the 
assumptions of Proposition 4.42 are indeed satisfied. Consider now the interpolation spaces of 
index 1 + 5. Since we know that B$ = B®, we have the characterisations 

Bi+s = {/ € P(A) : A/ + /' (l/2) 5 G B° s } , 
B° 1+s = {/ € 2>(A) : A/ G B° s } . 

Since on the other hand g g" B$ by assumption, it follows that B\ + $ n B® , s consists precisely of 
those functions in V(A) that have a vanishing derivative at 1/2. In particular, B\ + s / B® +s . 

One can also show that Z3_i/4 / B°_ x ^ in the following way. Let {f n } C V(L) be an arbitrary 
sequence of elements that form a Cauchy sequence in B 3 u. Since we have already shown that 
£>3/ 4 = B® u, this implies that {f n } is Cauchy in B® # 4 as well. It then follows from the definition 
of the interpolation spaces that the sequence {A/ n } is Cauchy in B_ 1 , 4 and that the sequence 
{(A + B)f n } is Cauchy in B_ 1 / i . Assume now by contradiction that B_ 1 / i = B Q _ x j^. 
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This would entail that both {Af n } and {Af n + Bf n } are Cauchy in B-iu, so that {f^(l/2)g} 
is Cauchy in 6-1/4. This in turn immediately implies that the sequence {f' n {\/2)} must be Cauchy 
in R. Define now f n by 

™ sin(4vrfcx) 

It is then straightforward to check that, since X)fc(^l°g 2 ^) 1 converges, this sequence is Cauchy 
in By 4 . On the other hand, we have f n (l/2) = J2k=i(k l°g which diverges, thus leading to 
the required contradiction. 

Exercise 4.44 Show, again in the same setting as above, that if g G B$ for some 5 > 0, then one 

has B a = B° a for every a € [0, 1 + 5). 

Remark 4.45 The operator B defined in (4. 14) is not a closed operator on B. In fact, it is not even 
closable! This is however of no consequence for Proposition 4.42 since the operator L = Lq + B 
is closed and this is all that matters. 



5 Linear SPDEs / Stochastic Convolutions 

We now apply the knowledge gathered in the previous sections to discuss the solution to linear 
stochastic PDEs. Most of the material from this section can also be found in one way or the other 
in the monographs [DPZ92b, DPZ96] by Da Prato and Zabczyk. The aim of this section is to 
define what we mean by the solution to a linear stochastic PDE of the form 

dx = Lxdt + Q dW(t) , x(0) = x , (5. 1) 

where we want x to take values in a separable Banach space B, L is the generator of a Co semigroup 
on B, W is a cylindrical Wiener process on some Hilbert space /C, and Q: K. — > B is a bounded 
linear operator. 

We do not in general expect x to take values in V(L) and we do not even in general expect 
QW(t) to be a B- valued Wiener process, so that the usual way of defining solutions to (5.1) by 
simply integrating both sides of the identity does not work. However, if we apply some £ 6 V(L*) 
to both sides of (5.1), then there is much more hope that the usual definition makes sense. This 
motivates the following definition: 

Definition 5.1 A B- valued process x(t) is said to be a weak solution to (5.1) if, for every t > 0, 
Jq ||x(s)|| ds < 00 almost surely and the identity 

(£, x(t)} = (£, x ) + [\l*£, x(s)) ds + [\q*£, dW(s)} , (5.2) 

JO JO 

holds almost surely for every £ £ V(L*). 

Remark 5.2 (Very important!) The term 'weak' refers to the PDE notion of a weak solution 
and not to the probabilistic notion of a weak solution to a stochastic differential equation. From a 
probabilistic point of view, we are always going to be dealing with strong solutions in these notes, 
in the sense that (5.1) can be solved pathwise for almost every realisation of the cylindrical Wiener 
process W. 

Just as in the case of stochastic ordinary differential equations, there are examples of stochastic 
PDEs that are sufficiently irregular so that they can only be solved in the probabilistic weak sense. 
We will however not consider any such example in these notes. 
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Remark 5.3 The stochastic integral in (5.2) can be interpreted in the sense of Section 3.4 since 
the map Q*£: K. -»■ R is Hilbert-Schmidt for every £ £ B*. 

Remark 5.4 Although separability of B was not required in the previous section on semigroup 
theory, it is again needed in this section, since many of the results from the section on Gaussian 
measure theory would not hold otherwise. 

On the other hand, suppose that /:R+ — > V(L) is a continuous function and consider the 
function x: R + — ► T>(L) given by x(t) = S(t)xo + Jq S(t — s)f(s) ds, where S is the Co-semigroup 
generated by L. If xo G T>(L) as well, then this function is differentiable and it is easy to check, 
using Proposition 4.7, that it satisfies the differential equation dtx = Lx + f. Formally replacing 
f(s) ds by Q dW(s), this suggests the following alternative definition of a solution to (5.1): 

Definition 5.5 A B-valued process x(t) is said to be a mild solution to (5.1) if the identity 

x(t) = S(f)x + /* S(t - s)Q dW(s) , (5.3) 
J o 

holds almost surely for every t > 0. The right hand side of (5.3) is also sometimes called a 
stochastic convolution. 



Remark 5.6 By the results from Section 3.4, the right hand side of (5.3) makes sense in any 
Hilbert space H containing B and such that J* * tr iS(t—s)QQ* S(t—s)* i* ds < oo, where i:B — > H 
is the inclusion map. The statement should then be interpreted as saying that the right hand side 
belongs toBcH almost surely. In the case where B is itself a Hilbert space, (5.3) makes sense if 
and only if j* * tr S(t - s)QQ*S(t - s)* ds < oo. 

It turns out that these two notions of solutions are actually equivalent: 

Proposition 5.7 If the mild solution is almost surely integrable, then it is also a weak solution. 
Conversely, every weak solution is a mild solution. 

Proof. Note first that, by considering the process x(t) — S(t)xo and using Proposition 4.7, we can 
assume without loss of generality that xq = 0. 

We now assume that the process x(t) defined by (5.3) takes values in B almost surely and we 
show that this implies that it satisfies (5.2). Fixing an arbitrary i E THjJ), applying L*£ to both 
sides of (5.3), and integrating the result between and t, we obtain: 

J\l*£, x(s)} ds = J* J\l*£, S{s - r)Q dW{r)) ds = J* (J* S*{s - r)L*£ds, Q dW{r)) . 

Using Proposition 4.7 and the fact that, by Proposition 4.15, S* is a strongly continuous semigroup 
on B\ the closure of V(L*) in B* , we obtain 

[\l*£, x(s)} ds = f\s*(t - r)£, Q dW(r)} - f (£, Q dW(r)} 
Jo Jo Jo 

= (e, ^ S(t - r)Q dW(r)) - J {£,Q dW(r)) 

= (£,x{t)) - [\e,QdW(r)) , 
Jo 
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thus showing that (5.2) holds for every £ G T>(jJ). To show that x is indeed a weak solution to 
(5.1), we have to extend this to every £ G V(L*). This however follows immediately from the fact 
that fit is weak-* dense in fi*, which was the content of Proposition 4.17. 

To show the converse, let now x(t) be any weak solution to (5.1) (again with xo = 0). Fix an 
arbitrary £ G V(]J), some final time t > 0, and consider the function f(s) = S*{t — s)£. Since 
£ G V(p), it follows from Proposition 4.7 that this function belongs to £ = C([0, t], V0)) n 
C 1 ([0, <],/?'). We are going to show that one has for such functions the almost sure identity 

{f{t), x(t)} = f\f(s) + L*f(s), x(s)} ds + f\f(s), Q dW(s)} . (5.4) 

JO J 

Since in our case f(s) + L*f(s) = 0, this implies that the identity 

(£, x(t)) = f\i, S(t - s)Q dW(s)} , (5.5) 
Jo 

holds almost surely for all £ G V{l)). By the closed graph theorem, fit is large enough to separate 
points in fi. 2 Since T>(L^) is dense in fit and since fi is separable, it follows that countably many 
elements of T>(U) are already sufficient to separate points in fi. This then immediately implies 
from (5.5) that x is indeed a mild solution. 

It remains to show that (5.4) holds for all / G £. Since linear combinations of functions of 
the type ip e (s) = £<p(s) for ip G C\[0,t],R) and £ G V0) are dense in £ (see Exercise 5.9 
below) and since x is almost surely integrable, it suffices to show that (5.4) holds for / = ip£. 
Since (£, QW{s)) is a standard one-dimensional Brownian motion, we can apply Ito's formula to 
ip(s)(£, x(s)}, yielding 

tp(t){£,x(t)) = (\{s){L*t,x{s))+ [* <p(s)(£,x(s)) + f <p(s)(£,QdW(s)) , 
Jo Jo Jo 

which coincides with (5.4) as required. □ 

Remark 5.8 It is actually possible to show that if the right hand side of (5.3) makes sense for some 
t, then it makes sense for all t and the resulting process belongs almost surely to L p ([0, T], fi) for 
every p. Therefore, the concepts of mild and weak solution actually always coincide. This follows 
from the fact that the covariance of x(t) increases with t (which is a concept that can easily be 
made sense of in Banach spaces as well as Hilbert spaces), see for example [DJT95]. 

Exercise 5.9 Consider the setting of the proof of Proposition 5.7. Let / G £ = C([0, 1],V0)) n 
C 1 ([0, l],fit) and, for n > 0, define /„ on the interval s G [k/n,(k + l)/n] by cubic spline 
interpolation: 

f n ( s ) = f(k/n)(k + 1 - ns) 2 (l + 2ns - 2k) + f((k + l)/n)(ns - kf(3 - 2ns + 2k) 
+ (ns - k)(k + 1 - ns) 2 n{f((k + ±)/n) - f((k - \)/n)) 
+ (ns - kf(ns -k- l)n(f((k + |)/n) - f((k + \)/n)) . 

Show that /„ is a finite linear combinations of functions of the form £ip(s) with ip G C 1 ([0, 1], R) 
and that /„ -» / in C([0, l],V(tf)) n C\[0, 1], fit). 

2 Assume that, for some x, y G B, we have (£, x) = {£, y) for every i € T>(L*). We can also assume without loss 
of generality that the range of L is £>, so that x = Lx' and y = Ly', thus yielding (L*£, x') — (L*£, y'). Since L is 
injective and has dense domain, the closed graph theorem states that the range of L* is all of B* , so that x' = y' and 
thus also x = y. 
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5.1 Time and space regularity 

In this subsection, we are going to study the space and time regularity of solutions to linear stochas- 
tic PDEs. For example, we are going to see how one can easily derive the fact that the solutions to 
the stochastic heat equation are 'almost' ^-Holder continuous in time and 'almost' ^-Holder con- 
tinuous in space. Since we are often going to use the Hilbert-Schmidt norm of a linear operator, 
we introduce the notation 

\\A\g s =tr AA* . 

For most of this section, we are going to make use of the theory of analytic semigroups. How- 
ever, we start with a very weak regularity result for the solutions to stochastic PDEs whose linear 
operator L generates an arbitrary Co-semigroup: 

Theorem 5.10 Let H and K, be separable Hilbert spaces, let L be the generator of a Co-semigroup 
on 71, let Q:K, — > TC be a bounded operator and let W be a cylindrical Wiener process on /C. 
Assume furthermore that HS^QHhs < oo for every t > and that there exists a £ (0, ^) such 
that Jq 1 i _2 ° ! ||5'(i)Q||Hs dt < oo. Then the solution x to (5.1) has almost surely continuous sample 
paths in TL. 

Proof. Note first that 1 1 S(t + s)Q 1 1 H s < 1 1 S(s) \ \ \ \ S(t)Q || HS , so that the assumptions of the theorem 
imply that J Q T t~ 2a \\S(t)Q\\n S dt < oo for every T > 0. Let us fix an arbitrary terminal time T 
from now on. Defining the process y by 

y(t) = f\t - sy a S(t - s)Q dW(s) , 
Jo 

we obtain the existence of a constant C such that 

E||y(t)|| 2 = [\t -s)~ 2a \\S(t-s)Q\\ 2 us ds= f s-^WS^QWlsdsKC , 
Jo Jo 

uniformly for t 6 [0, T]. It therefore follows from Fernique's theorem that for every p > there 
exist a constant C p such that 

E [ T \\y(t)\\ P dt <C P . (5.6) 
Jo 

Note now that there exists a constant c a (actually c a = (sin 2nd) / ir) such that the identity 

[\t-r) a -\r-s)- a dr= — , 

J s Ca 

holds for every t > s. It follows that one has the identity 

x(t) = S(t)x + c a J* J\t - r) a ~\r - S y a S(t -s)drQ dW(s) 

= S{t)x + c a f f\t - r) a -\r - S y a S(t - s)Q dW(s) dr 
Jo Jo 

= S(t)x + c a [ S(t - r) [\r - S y a S(r - s)Q dW(s) (t - r) a ~ l dr 
Jo Jo 

= S(t)x + c a [ S(t - r)y{r) (t - r)"" 1 dr . (5.7) 
Jo 
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The claim thus follows from (5.6) if we can show that for every a 6 (0, 5) there exists p > such 
that the map 

y^F y , F y (t) = f\t - rT-'Sit - r)y(r) dr 
Jo 

maps L p ([0, T],H) into C([0, T], Ti). Since the semigroup t S(t) is uniformly bounded (in the 
usual operator norm) on any bounded time interval and since t w (( - r) a_1 belongs to L q for 
q € [1, 1/(1— a)), we deduce from Holder's inequality that there exists a constant Ct such that one 
does indeed have the bound sup t6[0 T] ||-Fj/(i)|| p < Ct J \\y(t)\\ p dt, provided thatp > ^. Since 
continuous functions are dense in LP, the proof is complete if we can show that F y is continuous 
for every continuous function y with y(0) = 0. 

Fixing such a y, we first show that F y is right-continuous and then that it is left continuous. 
Fixing t > 0, we have for h > the bound 

\\F y (t + h)- F y (t)\\ < A((t + h- rT- l S{h) - (t - rr-^SQ - r)y(r)\\ dr 
Jo 

rt+h 

+ j (t + h-rr^WSit + h-rMr^dr 

The second term is bounded by 0{h 5 ) for some 5 > by Holder's inequality. It follows from 
the strong continuity of S that the integrand of the first term converges to pointwise as h — ► 0. 
Since on the other hand the integrand is bounded by C(t — r) a_1 ||y(r)|| for some constant C, 
this term also converges to by the dominated convergence theorem. This shows that F y is right 
continuous. 

To show that F y is also left continuous, we write 

\\F y (t) - F y (t - h)\\ < f h \\iSt - r) a - l S{h) -{t-h- r) a - l )S{t -h- r)y(r)\\ dr 
Jo 

+ [ {t-rT- l \\S{t-r)y{r)\\dr . 
Jt-h 

We bound the second term by Holder's inequality as before. The second term can be rewritten as 

/*||((t + h- r) a - l S{h) - (t - rr-^Sit - r)y(r - h)\\ dr , 
Jo 

with the understanding that y(r) = for r < 0. Since we assumed that y is continuous, we can 
again use the dominated convergence theorem to show that this term tends to as h — ► 0. □ 

Remark 5.11 The trick employed in (5.7) is sometimes called the "factorisation method" and 
was introduced in the context of stochastic convolutions by Da Prato, Kwapieri, and Zabczyk 
[DPKZ87, DPZ92a]. 

This theorem is quite sharp in the sense that, without any further assumption on Q and L, it 
is not possible in general to deduce that t x(t) has more regularity than just continuity, even if 
we start with a very regular initial condition, say xo = 0. We illustrate this fact with the following 
exercise: 

Exercise 5.12 Consider the case H = L 2 (R), K. = R, L = d x and Q = g for some g G L 2 (R) 
such that g > and g(x) = \x\~@ for some (3 G (0, |) and all \x\ < 1. This satisfies the conditions 
of Theorem 5.10 for any a < 1. 
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Since L generates the translation group, the solution to 

du(x, t) = d x u(x, t) dt + g(x) dW(t) , u(x, 0) = , 

is given by 

u(x, t) = / g(x + t - s) dW{s) . 
Jo 

Convince yourself that for fixed t, the map x i— > u(x, t) is in general 7-Holder continuous for 
7 < ^ — (3, but no better. Deduce from this that the map t u(-,t) is in general also 7-Holder 
continuous for 7 < \ — [3 (if we consider it either as an H-valued map or as a Cf,(R)-valued map), 
but cannot be expected to have more regularity than that. Since (3 can be chosen arbitrarily close 
to \, it follows that the exponent a appearing in Theorem 5.10 is in general independent of the 
Holder regularity of the solution. 

One of the main insights of regularity theory for parabolic PDEs (both deterministic and 
stochastic) is that space regularity is intimately linked to time regularity in several ways. Very 
often, the knowledge that a solution has a certain spatial regularity for fixed time implies that it 
also has a certain temporal regularity at a given spatial location. 

From a slightly different point of view, if we consider time-regularity of the solution to a PDE 
viewed as an evolution equation in some infinite-dimensional space of functions, then the amount 
of regularity that one obtains depends on the functional space under consideration. As a general 
rule, the smaller the space (and therefore the more spatial regularity it imposes) the lower the 
regularity of the solution, viewed as a function with values in that space. 

We start by giving a general result that tells us precisely in which interpolation space one can 
expect to find the solution to a linear SPDE associated with an analytic semigroup. This provides 
us with the optimal spatial regularity for a given SPDE: 

Theorem 5.13 Consider (5.1) on a HUbert space ri, assume that L generates an analytic semi- 
group, and denote by TL a the corresponding interpolation spaces. If there exists a > such that 
Q: K, — ► H a is bounded and (3 G (0, \ + a] such that ||(— L) _ ^||hs < °o then the solution x takes 
values in Hjfor every 7<7o = ^ + a — (3. 

Proof. As usual, we can assume without loss of generality that belongs to the resolvent set of L. 
It suffices to show that 

I(T) ^ f T \\(-LyS(t)Q\\l s dt < 00 , VT > . 

Since Q is assumed to be bounded from JC to H a , there exists a constant C such that 

KT) <C [ T \\(-LyS(t)(-Lr a \\ 2 us dt = C [ T \\(-Lr~ a S(t)\\ 2 m dt . 
Jo Jo 

Since (— L) _/3 is Hilbert-Schmidt, we have the bound 

\\(-Ly- a S(t)\\ m < \\(-L)-P\\ HS \\(-Lf^- a S(t)\\ < c(ivr^) . 

For this expression to be square integrable near t = 0, we need a — 7 — (3 > — \, which is precisely 
the stated condition. □ 

Exercise 5.14 Show that if we are in the setting of Theorem 5.13 and L is selfadjoint, then the 
solutions to (5.1) actually belong to H 7 for 7 = 70. 
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Exercise 5.15 Show that the solution to the stochastic heat equation on [0, 1] with periodic bound- 
ary conditions has solutions in the fractional Sobolev space H 8 for every s < 1/2. Recall that H s 
is the Hilbert space with scalar product (f,g) s = J2k fkdkO- + k 2 ) s , where f k denotes the kth 
Fourier coefficient of /. 

Exercise 5.16 Consider the following modified stochastic heat equation on [0, l] d with periodic 
boundary conditions: 

dx = Axdt + {1 - Ay^dW , 

where W is a cylindrical Wiener process on L 2 ([0, For any given s > 0, how large does 7 
need to be for x to take values in H s ? 

Using this knowledge about the spatial regularity of solutions, we can now turn to the time- 
regularity. We have: 

Theorem 5.17 Consider the same setting as in Theorem 5.13 and fix 7 < 70. Then, at all times 
t > 0, the process x is almost surely 5-Hdlder continuous in Jiyfor every 5 < \ A (70 — 7). 

Proof. It follows from Kolmogorov's continuity criteria, Proposition 3.18, that it suffices to check 
that the bound 

E||x(t) - x(s)|| 2 < C\t - s\ 1A2( ^ } 

holds uniformly in s, t G [to, T] for every to, T > and for every 7 < 70- Here and below, C is 
an unspecified constant that changes from expression to expression. Assume that t > s from now 
on. It follows from the semigroup property and the independence of the increments of W that the 
identity 

x(t) = S(t - s)x(s) + J S(t- r)Q dW(r) , (5.8) 

holds almost surely, where the two terms in the sum are independent. This property is also called 
the Markov property. Loosely speaking, it states that the future of x depends on its present, but 
not on its past. This transpires in (5.8) through the fact that the right hand side depends on x(s) 
and on the increments of W between times s and t, but it does not depend on x(r) for any r < s. 

Furthermore, x(s) is independent of the increments of W over the interval [s,t], so that Propo- 
sition 4.41 allows us to get the bound 

E||s(t) - x(s)\\ 2 = E\\S(t - s)x(s) - x(s)\\ 2 + J* S \\(-LrS(r)Q\\ 2 HS dr 

< C\t - s| 2( ^- 7)A2 E||x(s)||? + C f \l V r a -~<-Pf dr . 

Jo 

Here, we obtained the bound on the second term in exactly the same way as in the proof of 
Theorem 5.13. The claim now follows from the fact that a — 7 — j3 = (70 — 7) — \. □ 

5.2 Long-time behaviour 

This section is devoted to the behaviour of the solutions to (5.1) for large times. Let's again start 
with an example that illustrates some of the possible behaviours. 

Example 5.18 Let x i-> V(x) be some smooth 'potential' and let H = L 2 (R, exp(-V(x)) dx). 
Let S denote the translation semigroup (to the right) on Ti and denote its generator by — d x . Let us 
first discuss which conditions on V ensure that S is a strongly continuous semigroup on H. It is 
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clear that it is a semigroup and that S(t)u — > u for u any smooth function with compact support. 
It therefore only remains to show that H-S^H is uniformly bounded for t G [0, 1] say. We have 

\\S(t)u\\ 2 = J u\x - t)e- V{x) dx = J u\x)e- V{x) e V{x) - V{x+t) dx . (5.9) 

This shows that a necessary and sufficient condition for S to be a strongly continuous semigroup 
on H is that, for every t > 0, there exists Ct such that sup xgR (V(a;) — V(x + t)) < Ct and 
such that Ct remains bounded as t — > 0. Examples of potentials leading to a Co-semigroup are x, 
\/l + x 2 , log(l + x 2 ), etc or any increasing function. Note however that the potential V(x) = x 2 
does not lead to a strongly continuous semigroup. One different way of interpreting this is to 
consider the unitary transformation K:u^ exp(i V)u from the 'flat' space L 2 into 7i. Under this 
transformation, the generator — d x is turned into 

-(K~ l d x Ku)(x) = -d x u(x) - \V{x)u{x) . 

Considering the characterisation of generators of Co-semigroups given by the Hille-Yosida theo- 
rem, one would expect this to be the generator of a strongly continuous semigroup if V' is bounded 
from below, which is indeed a sufficient condition. 

Let now V be such that S is a Co-semigroup and consider the SPDE on H given by 

du(x, t) = -d x u(x, t) dt + f{x) dW(t) , (5.10) 

where W is a one-dimensional Wiener process and / is some function in H. The solution to (5.10) 
with initial condition no = is given as before by 

u(x,t)= f f(x + s-t)dW(s) . (5.11) 
J o 

If we fix the time t, we can make the change of variable s h-> t — s, so that u(x,t) is equal in 
distribution to J * f(x — s) dW(s). 

We see that if / happens to be also square integrable (we will assume that this is the case in the 
sequel and we will also assume that / is not identically zero), then (5.11) has a limit in distribution 
as t — > oo given by 

roo 

u(x) = f{x- s)dW(s) . (5.12) 
Jo 

It is however not clear a priori that u does belong to H. On one hand, we have the bound 

E / u{xfe~ V{x) dx = [ r f 2 {x-t)dte~ V{x) dx< [ f(t)dt [ e~ V(x) dx , 
Jr JrJo Jr Jr 

thus showing that u definitely belongs to H if e~ v has finite mass. On the other hand, there are 
examples where u G 7i even though e~ v has infinite mass. For example, if f(x) = for x < 0, 
then it is necessary and sufficient to have J °° e~ v<yX) dx < oo. Denote by v the law of u for further 
reference. 

Furthermore, if e~ v is integrable, there are many measures on H that are invariant under the 
action of the semigroup S. For example, given a function h G H which is periodic with period 
r (that is S(r)h = h), we can check that the push-forward of the Lebesgue measure on [0, r] 
under the map t S(t)h is invariant under the action of S. This is simply a consequence of the 
invariance of Lebesgue measure under the shift map. Given any invariant probability measure fih 
of this type, let v be an H-valued random variable with law /i/j that is independent of W. We can 
then consider the solution to (5.10) with initial condition v. Since the law of S(t)v is equal to the 
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law of v by construction, it follows that the law of the solution converges to the distribution of the 
random variable u + v, with the understanding that u and v are independent. 

This shows that in the case J e~ V(x) dx < oo, it is possible to construct solutions u to (5.10) 
such that the law of «(• , t) converges to fih * v for any periodic function h. 

Exercise 5.19 Construct an example of a potential V such that the semigroup S from the previous 
example is not strongly continuous by choosing it such that limt_»o ||£(*)|| = even though 
each of the operators S(t) for t > is bounded! Hint: Choose V of the form V(x) = x 3 — 
J2n>o nWi^f^), where W is an isolated 'spike' and c n are suitably chosen constants. 

This example shows that in general, the long-time behaviour of solutions to (5.1) may depend 
on the choice of initial condition. It also shows that depending on the behaviour of H, L and Q, 
the law of the solutions may or may not converge to a limiting distribution in the space in which 
solutions are considered. 

In order to formalise the concept of 'long-time behaviour of solutions' for (5. 1), it is convenient 
to introduce the Markov semigroup associated to (5.1). Given a linear SPDE with solutions in B, 
we can define a family Vt of bounded linear operators on B^B), the space of Borel measurable 
bounded functions from B to R by 

{Pt<p)(.x) = E<p(s(t)x + J* S(t - s)Q dW(s)) . (5.13) 

The operators Vt are Markov operators in the sense that the map A Vt^Aix) is a probability 
measure on B for every fixed x. In particular, one has Vtl = 1 and Vtf > if 92 > 0, that is 
the operators Vt preserve positivity. It follows furthermore from (5.8) and the independence of 
the increments of W over disjoint time intervals that Vt satisfies the semigroup property Vt+ S = 
Vt o V s for any two times s, t > 0. 

Exercise 5.20 Show that Vt maps the space Cb(B) of continuous bounded functions from B to R 
into itself. (Recall that we assumed B to be separable.) 

If we denote by Vt(x, ■ ) the law of S(t)x + j Q S(t — s)Q dW(s), then Vt can alternatively be 
represented as 

{Pt<p)(x)= f ip(y)V t (x,dy) . 
Jb 

It follows that its dual V\ acts on measures with finite total variation by 

(Ptn)(A)= f V t (x,A)ti(dx) . 

JB 

Since it preserves the mass of positive measures, VI is a continuous map from the space Pi(fi) of 
Borel probability measures on B (endowed with the total variation topology) into itself. It follows 
from (5.13) and the definition of the dual that V t fi is nothing but the law at time t of the solution 
to (5.1) with its initial condition no distributed according to u, independently of the increments of 
W over [0, t]. With these notations in place, we define: 

Definition 5.21 A Borel probability measure u on B is an invariant measure for (5.1) if V* \i = u 
for every t > 0, where Vt is the Markov semigroup associated to solutions of (5.1) via (5.13). 

In the case B = H where we consider (5.1) on a Hilbert space H, the situations in which such 
an invariant measure exists are characterised in the following theorem: 
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Theorem 5.22 Consider (5.1) with solutions in a Hilbert space TL and define the self-adjoint 
operator Qt-TL — > TL by 

Qt= [ S(t)QQ*S*(t)dt . 
Jo 

Then there exists an invariant measure p. for (5.1) if and only if one of the following two equivalent 
conditions are satisfied: 

1. There exists a positive definite trace class operator Qoo-TL — ► TL such that the identity 
2Re{QooL*x,x) + ||<9*x|| 2 = holds for every x £ V(L*). 

2. One has sup t>0 tr Q t < oo. 

Furthermore, any invariant measure is of the form v * p,^, where v is a measure on TL that is 
invariant under the action of the semigroup S and p,^ is the centred Gaussian measure with 
covariance Q^. 

Proof. The proof goes as follows. We first show that p being invariant implies that 2. holds. Then 
we show that 2. implies 1., and we conclude the first part by showing that 1. implies the existence 
of an invariant measure. 

Let us start by showing that if \i is an invariant measure for (5.1), then 2. is satisfied. By 
choosing ip(x) = e 1 ^'^ for arbitrary h €. TL, it follows from (5.13) that the Fourier transform of 
VtP satisfies the equation 

VTnix) = fi{S*{t)x)e-^ x ' Qtx) . (5.14) 

Taking logarithms and using the fact that \fi(x)\ < 1 for every x € TL and every probability 
measure p, It follows that if p is invariant, then 

(x,Q t x) < -2\og\p(x)\ , VxGTL, Vt>0. (5.15) 

Choose now a sufficiently large value of R > so that fi(\\x\\ > R) < 1/8 (say) and define a 
symmetric positive definite operator Ar: TL ^ TL by 

(h,A R h) = f \(x,h)\ 2 p(dx) . 

J\\x\\<R 

Since, for any orthonormal basis, one has ||x|| 2 = J2 n K x > e n}\ 2 , it follows that Ar is trace class 
and that tr Ar < R 2 . Furthermore, one has the bound 

|1 - Kh)\ < J \l - fi(dx) < J^ArT) + i . 

Combining this with (5.15), it follows that (x, Qtx) is bounded by 2 log 4 for every x € TL such 
that (x, Arx) < 1/4 so that, by homogeneity, 

(x,Q t x) < (8log4){x, Arx) . 

It follows that trQt < (81og4)i? 2 , so that 2. is satisfied. To show that 2. implies 1., note that 
sup tr Q t < oo implies that 

Qoo = / S(t)QQ*S*(t)dt, 
Jo 

1/2 

is a well-defined positive definite trace class operator (since t i-> Q t ' forms a Cauchy sequence 
in the space of Hilbert-Schmidt operators). Furthermore, one has the identity 

<ar,QooZ> = (S*(i)x,QooS*(t)x) + /* \\Q* S* (s)x\\ 2 ds . 

Jo 
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for x G V(L*), both terms on the right hand side of this expression are differentiable. Taking the 
derivative at t = 0, we get 

= 2Re(Q 00 L*x,x) + ||C?*a;|| 2 , 

which is precisely the identity in 1 . 

Let now Qoo be a given operator as in 1., we want to show that the centred Gaussian measure 
^oo with covariance Qoo is indeed invariant for Vt- For x G V{L*), it follows from Proposition 4.7 
that the map F x :t (Q 00 S*(t)x, S*(t)x) is differentiable with derivative given by dtF x (t) = 
2Re(Q 00 L*S*(t)x, S*(t)x). It follows that 

F x (t) - F x (0) = 2 f t Re{Q 00 L*S*{s)x,S*{s)x)ds = - f \\Q* S* {s)x\\ 2 ds , 
Jo Jo 

so that one has the identity 

Qoo = 5(t)Qoo5*(t) + /* S(s)QQ*S*(s)ds = SiVQooSW + Q t . 
Jo 

Inserting this into (5.14), the claim follows. Here, we used the fact that V(L*) is dense in H, 
which is always the case on a Hilbert space, see [Yos95, p. 196]. 

Since it is obvious from (5.14) that every measure of the type u * /^oo with u invariant for S 
is also invariant for Vt, it remains to show that the converse also holds. Let ^ be invariant for Vt 
and define ^ t as the push-forward of under the map S(t). Since (it(x) = fi(S*(t)x), it follows 
from (5.14) and the invariance of \i that there exists a function tp: TL — > R such that jlt(x) — ► ip(x) 
uniformly on bounded sets, tp o S(t)* = ip, and such that fi{x) = ip(x)exp(—^(x,Q 00 x}). It 
therefore only remains to show that there exists a probability measure v on H such that ijj = v. 

In order to show this, it suffices to show that the family of measures {fi t } is tight, that is for 
every e > there exists a compact set K such that fit(K) > 1 — e for every t. Prokhorov's theorem 
[Bil68, p. 37] then ensures the existence of a sequence t n increasing to oo and a measure v such 
that nt n — > v weakly. In particular, jlt n (x) — > v(x) for every x G H, thus concluding the proof. 

To show tightness, denote by u t the centred Gaussian measure on H with covariance Qt and 
note that one can find a sequence of bounded linear operators A n :H — ► H with the following 
properties: 

a. One has || A n+ ix|| > || Ai^H f° r every x G TL and every n > 0. 

b. The set Br = {x : sup n ||-A n x|| < R} is compact for every R > 0. 

c. One has sup^trAnQoo^* < oo. 

(By diagonalising Qoo, the construction of such a family of operators is similar to the construction, 
given a positive sequence {X n } with J2n < oo, of a positive sequence a n with lim n _,oo a n = 
+oo and J2 n °nA n < oo.) Let now e > be arbitrary. It follows from Prokhorov's theorem that 
there exists a compact set K C H such that fi(H \ K) < |. Furthermore, it follows from property 
c. above and the fact that C}oo > Qt that there exists R > such that v t {H \ Br) < |. Define a 
set K C H by 

K = {z-y : z £ K ,y £ B R } . 

It is straightforward to check, using the Heine-Borel theorem, that K is precompact. 

If we now take X and Y to be independent 7i-valued random variables with laws n t and u t 
respectively, then it follows from the definition of a mild solution and the invariance of \i that Z = 
X + Y has law /x. Since one has the obvious implication {Z G K}Sz{Y G Br} =>• {X G K}, it 
follows that 

Ht(H \ K) = P(X £K)< P(Z £ K) + P(Y $B R )<e, 
thus showing that the sequence {/j, t } is tight as requested. □ 
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It is clear from Theorem 5.22 that if (5.1) does have a solution in some Hilbert space TL and 
if ||<S(i)|| —> as t — > oo in that same Hilbert space, then it also possesses a unique invariant 
measure on TL. It turns out that as far as the "uniqueness" part of this statement is concerned, it is 
sufficient to have lim^oo ||£(i)a;|| = for every xeH: 

Proposition 5.23 Tflim^oo ||S , (t)x|| = for every x G TL, then (5.1) can have at most one 
invariant measure. Furthermore, if an invariant measure p,^ exists in this situation, then one has 
Vf v — > /^oo weakly for every probability measure v on TL. 

Proof. In view of Theorem 5.22, the first claim follows if we show that Sq is the only measure that 
is invariant under the action of the semigroup S. Let v be an arbitrary probability measure on TL 
such that S(t)*v = v for every t > and let ip: TL — > R be a bounded continuous function. On 
then has indeed 



where we first used the invariance of v and then the dominated convergence theorem. 

To show that V^v — > fi^ whenever an invariant measure exists we use the fact that in this 
case, by Theorem 5.22, one has Q t | Qoo in the trace class topology. Denoting by pt the centred 
Gaussian measure with covariance Q t , the fact that I? convergence implies weak convergence 
then implies that there exists a measure fi^ such that fi t —> Aoo weakly. Furthermore, the same 
reasoning as in (5.16) shows that S(t)*v — ► Sq weakly as t — > oo. The claim then follows from 
the fact that V\ v = (S(t)* v) * fit and from the fact that convolving two probability measures is a 
continuous operation in the topology of weak convergence. □ 

Note that the condition lim^oo = for every x is not sufficient in general to guaran- 

tee the existence of an invariant measure for (5.1). This can be seen again with the aid of Exam- 
ple 5.18. Take an increasing function V with lirn E _ +00 V(x) = oo, but such that J °° e~ V(x) dx = 
oo. Then, since exp(V(x) — V(x + t)) < 1 and lim^oo exp(V(x) — V(x + t)) = for every 
x G R, it follows from (5.9) and the dominated convergence theorem that lim^oo ||5(t)it|| = for 
every u £H. However, the fact that J °° e~ v ^ dx = oo prevents the random process u defined in 
(5.12) from belonging to H, so that (5.10) has no invariant measure in this particular situation. 

Exercise 5.24 Show that if (5.1) has an invariant measure fi^ but there exists x G TL such that 
lim sup t ^ oc > 0, then one cannot have Vlb x — ► fi^ weakly. In this sense, the statement 

of Proposition 5.23 is sharp. 

5.3 Convergence in other topologies 

Proposition 5.23 shows that if (5.1) has an invariant measure fi^, one can in many cases expect to 
have V\ v — > fi^ weakly for every initial measure v. It is however not clear a priori whether such 
a convergence also holds in some stronger topologies on the space of probability measures. If we 
consider the finite-dimensional case (that is TL = R n for some n > 0), the situation is clear: the 
condition lim^oo = for every x G TL then implies that lim^oo ||iS(£)|| = 0, so that L 

has to be a matrix whose eigenvalues all have strictly negative real parts. One then has: 

Proposition 5.25 In the finite-dimensional case, assume that all eigenvalues of L strictly negative 
real parts and that Qoo has full rank. Then, there exists T > such that VI 5 X has a smooth 
density p t ,x with respect to Lebesgue measure for every t > T. Furthermore, fi^ has a smooth 




(5.16) 
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density with respect to Lebesgue measure and there exists c > such that, for every A > 0, 
one has 

Urn e ct sup e x M\ Poo (y) - Pt,Av)\ = • 
j/eR n 

In other words, p t>x converges to p^ exponentially fast in any weighted norm with exponentially 
increasing weight. 

The proof of Proposition 5.25 is left as an exercise. It follows in a straightforward way from 
the explicit expression for the density of a Gaussian measure. 

In the infinite-dimensional case, the situation is much less straightforward. The reason is that 
there exists no natural reference measure (the equivalent of the Lebesgue measure) with respect to 
which one could form densities. 

In particular, even though one always has \\p t — /Uoo||oo —> in the finite-dimensional case 
(provided that Poo exists and that all eigenvalues of L have strictly negative real part), one cannot 
expect this to be true in general. Consider for example the SPDE 

dx = -xdt + Q dW(t) , x(t) £ H , 

where W is a cylindrical process on 7i and Q: H — > H is a Hilbert-Schmidt operator. One then 
has 

l-e~ 2t „ 1 

Qt = — 2 — ' Q°° = ~^QQ ■ 

Combining this with Proposition 3.40 (dilates of an infinite-dimensional Gaussian measure are 
mutually singular) shows that if QQ* has infinitely many non-zero eigenvalues, then ji t and p.^ 
are mutually singular in this case. 

One question that one may ask oneself is under which condi- 
tions the convergence V\ — > iioo takes place in the total variation 
norm. The total variation distance between two probability mea- 
sures is determined by their 'overlap' as depicted in the figure on 
the right: the total variation distance between [i and v is given by 
the dark gray area, which represents the parts that do not overlap. 
If p, and v have densities and V u with respect to some common 
reference measure it (one can always take tt = \{p + v)), then one 
has \\fi — z^|tv = / — V v (x)\ ir(dx). 

Exercise 5.26 Show that this definition of the total variation distance does not depend on the 
particular choice of a reference measure. 

The total variation distance between two probability measures /x and u on a separable Banach 
space B can alternatively be characterised as 

||M - v\\tv = 2 inf ir{{x + y}) , (5.17) 

where the infimum runs over the set C(p,v) of all probability measures it on B x B with marginals 
p, and v. (This set is also called the set of all couplings of \i and v.) In other words, if the total 
variation distance between fi and u is smaller than 2e, then it is possible to construct £?-valued 
random variables X and Y with respective laws p and v such that X = Y with probability larger 
than 1 — e. 

This yields a straightforward interpretation to the total variation convergence V% — ► Poz '■ f° r 
large times, a sample drawn from the invariant distribution is with high probability indistinguish- 
able from a sample drawn from the Markov process at time t. Compare this with the notion of 




Linear SPDEs / Stochastic Convolutions 



57 



weak convergence which relies on the topology of the underlying space and only asserts that the 
two samples are close with high probability in the sense determined by the topology in question. 
For example, \\S X — S y \\ is always equal to 2 if x ^ y, whereas 5 X — > 5 y weakly if x — > y. 

Exercise 5.27 Show that the two definitions of the total variation distance given above are indeed 
equivalent by constructing a coupling that realises the infimum in (5.17). It is useful for this to 
consider the measure \i A v which, in [i and v have densities and T> v with respect to some 
common reference measure tt, is given by (V^x) A T> u (x))ir(dx). 

An alternative characterisation of the total variation norm is as the dual norm to the supremum 
norm on the space Bf,(B) of bounded Borel measurable functions on £>: 

Wfj, — v\\tv = supl / (p(x)fj,(dx) — / ip(x)v(dx) : sup \<p(x)\ < 1 \ . 

It turns out that, instead of showing directly that V^v — > [i^ in the total variation norm, it is 
somewhat easier to show that one has V^v — ► ^ in a type of 'weighted total variation norm', 
which is slightly stronger than the usual total variation norm. Given a weight function V : B — ► R + , 
we define a weighted supremum norm on measurable functions by 

„ I, I 0*01 

xet3 1 + V(x) 

as well as the dual norm on measures by 

— ^||tv,v = sup| J (p(x)n(dx) — J (p(x)f(dx) : \\y>\\v < l} • (5.18) 

Since we assumed that V > 0, it is obvious that one has the relation — v\\tw < — ^HtVjVj 
so that convergence in the weighted norm immediately implies convergence in the usual total 
variation norm. By considering the Jordan decomposition of fi — v = q + — Q-, it is clear that the 
supremum in (5.18) is attained at functions ip such that ip(x) = 1 + V(x) for q + -almost every x 
and ip{x) = — 1 — V(x) for -almost every x. In other words, an alternative expression for the 
weighted total variation norm is given by 

II/" - v\\tv,v = [ (1 + V{x)) \fi - u\{dx) , (5.19) 

J X 

just like the total variation norm is given by ||/x — i/||tv = \n ~ v\(X). 

The reason why it turns out to be easier to work in a weighted norm is the following: For a 
suitable choice of V, we are going to see that in a large class of examples, one can construct a 
weight function V and find constants c < 1 and T > such that 

\\VtH ~ ^tHItv.v < c\\n - v\\iwy , (5.20) 

for any two probability measures /j, and v. This implies that the map Vt is a contraction on the 
space of probability measures, which must therefore have exactly one fixed point, yielding both 
the existence of an invariant measure and the exponential convergence of VI v to jjl^ for every 
initial probability measure v which integrates V. 

This argument is based on the following abstract result that works for arbitrary Markov semi- 
groups on Polish (that is separable, complete, metric) spaces: 
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Theorem 5.28 (Harris) Let Vt be a Markov semigroup on a Polish space X such that there exists 
a time Tq > and a function V: X — > R + such that: 

• The exist constants 7 < 1 and K > such that Vt V(x) < jV(x) + K for every x £ X. 

• For every K' > 0, there exists 5 > swc/j that \\Vj^ 5 X — S y \\iy < 2 — (5 /or every pair 
x, y such that V(x) + V(y) < K'. 

Then, there exists T > such that (5.20) holds for some c < 1. 

In a nutshell, the argument for the proof of Theorem 5.28 is the following. There are two 
mechanisms that allow to decrease the weighted total variation distance between two probability 
measures: 

2. The mass of the two measures moves into regions where the weight V(x) becomes smaller. 
1. The two measures 'spread out' in such a way that there is an increase in the overlap between 



The two conditions of Theorem 5.28 are tailored such as to combine these two effects in order to 
obtain an exponential convergence of V^p to the unique invariant measure for Vt as t — > 00. 

Remark 5.29 The condition that there exists 5 > such that \\Vj~ 5 x - V^yW-rs < 2 - (5 for 
any x, y € A is sometimes referred to in the literature as the set A being a small set. 

Remark 5.30 Traditional proofs of Theorem 5.28 as given for example in [MT93] tend to make 
use of coupling arguments and estimates of return times of the Markov process described by Vt to 
level sets of V. The basic idea is to make use of (5. 17) to get a bound on the total variation between 
V^p, and V^v by constructing an explicit coupling between two instances xt and y t of a Markov 
process with transition semigroup {Vt}- Because of the second assumption in Theorem 5.28, 
one can construct this coupling in such a way that every time the process (xt, yt) returns to some 
sufficiently large level set of V, there is a probability 5 that x t > = yt' for t' > t + Tq. The 
first assumption then guarantees that these return times have exponential tails and a renewal-type 
argument allows to conclude. 

Such proofs are quite involved at a technical level and are by consequent not so easy to follow, 
especially if one wishes to get a spectral gap bound like (5.20) and not "just" an exponential decay 
bound like 



with a constant C depending on x and y. Furthermore, they require more background in advanced 
probability theory than what is assumed for the scope of these notes. 

The elementary proof given here is taken from [HM08b] and is based on the arguments first 
exposed in [HM08a]. It has the disadvantage of being less intuitively appealing than proofs based 
on coupling arguments, but this is more than offset by the advantage of fitting into less than two 
pages without having to appeal to advanced mathematical concepts. It also has the advantage of 
being generalisable to situations where level sets of the Lyapunov function are not small sets, see 
[HMS09]. 

Before we turn to the proof of Theorem 5.28, we define for every (3 > the distance function 



them. 
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One can check that the positivity of V implies that this is indeed a distance function, albeit a rather 
strange one. We define the corresponding 'Lipschitz' seminorm on functions ip: X — ► R by 

M Lip, = SUp — . 

x^y dp(x, y) 
We are going to make use of the following lemma: 

Lemma 5.31 With the above notations, one has H^Hup^ = inf c gR ||<£> + c\\py 

Proof. It is obvious that ||v||up^ < llv 3 + C \\/3V f° r every c £ R. On the other hand, if x is any 
fixed point in X, one has 

\^{x)\ < \<p(x )\ + 1^11^(2 + 0V{x) + pVfro)) , (5-21) 

for all x G X. Set now 

c = - sup((^(x) - ||^|| L ip a (l +PV(x))) . 

x<=X 

It follows from (5.21) that c is finite. Furthermore, one has 

y%) + c < <p(y) - (ip(y) - iMi LiP/3 (i + my))) = y\\u P0 (i + my)) , 



and 



ip(y) + c= \nf{<p{y) - <p(x) + |M| L i Pfl (l + PV(x))) 

XdX 



> mf |M| L i P/3 (l + PV{x) - dp{x,y)) = -||(p||Li P/3 (l + PV(y)) , 



which implies that \\ip + c\\pv < ||v||up^- □ 

def 

Proof of Theorem 5.28. During this proof, we use the notation V = Vt for simplicity. We are 
going to show that there exists a choice of (3 G (0, 1) such that there is a < 1 satisfying the bound 

\V<p(x) - V<p(y)\ < adp(x, y)|M| L i P/3 , (5.22) 

uniformly over all measurable functions (p: X —> R and all pairs x, y G Af. Note that this is 
equivalent to the bound ||P^||Li P(3 < "IMlLip^- Combining this with Lemma 5.31 and (5.19), we 
obtain that, for T = uTq, one has the bound 

\\VtH-Vtv\\tv,v= inf / CPT<p)(.x)(n-i/)(.dx) 

\\<p\\v<lJx 
inf inf f ((%)(i) + c) (// - i/)(dr) 



< inf inf UPtV + c\\ v I (1 + F(x)) |/x - i/|(dx) 
llvll v<l ceR Jx 

= inf /3 _1 inf \[P T ip + c^yll/x - z/|| TVV 
IMIv<l ceR 

= Z?" 1 inf ll^r^llLip^l^ - i/Htv.v 

II Vll v<L 

/ «™ • * , ll ll / 
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Since a < 1, the result (5.20) then follows at once by choosing n sufficiently large. 

Let us turn now to (5.22). If x = y, there is nothing to prove, so we assume that x / y. 
Fix an arbitrary non-constant function ip and assume without loss of generality that H^Hup^ = 1- 
It follows from Lemma 5.31 that, by adding a constant to it if necessary, we can assume that 
|¥>(z) + c| < (1 + PV(x)). 

This immediately implies the bound 

\Pcp(x) - Vip(y)\ < (2 + (3VV(x) + pPV(y)) 

< 2 + + PyV(x) + /3jV(y) . 

Suppose now that x and y are such that V(x) + V(y) > 2 ^ 2 . A straightforward calculation 
shows that in this case, for every [3 > there exists a± < 1 such that (5.22) holds. One can choose 
for example 

_ 1 1 P 
ai 21- 7 + (3K + p- 

Take now a pair x, y such that V(x) + V(y) < 2 ^j^ 2 . Note that we can write tp = tpi + ip 2 
with \(p!(x)\ < 1 and \<p 2 (x)\ < 0V(x). (Set ^i(x) = (<p(x) A 1) V (-1).) It follows from the 
assumptions on V that there exists some 5 > such that 

\Vip(x) - Vip(y)\ < \P<pi(x) - P<pi(y)\ + \V<p 2 (x) - V<p 2 (y)\ 

< \\ V * dx - V*5 y \\ TW + (3{VV){x) + f3{VV){y) 

<2-5 + (3(2K + ~/V(x) + jV(y)) <2-5 + 2/3K^-^- . 

1-7 

If we now choose (5 < j^jj^, (5.22) holds with a = 1 — ^5 < 1. Combining this estimate with 
the one obtained previously shows that one can indeed find a and (3 such that (5.22) holds for all 
x and y in X, thus concluding the proof of Theorem 5.28. □ 

One could argue that this theorem does not guarantee the existence of an invariant measure 
since the fact that V^/j. = \i does not guarantee that V t fj, = \i for every t. However, one has: 

Lemma 5.32 If there exists a probability measure such that V^p, = p-for some fixed time T > 0, 
then there also exists a probability measure floo such that V^Hoc = Poo far all t > 0. 

Proof. Define the measure p,^ by 

Moo 04) = ^£v;n(A)dt. 
It is then a straightforward exercise to check that it does have the requested property. □ 

We are now able to use this theorem to obtain the following result on the convergence of the 
solutions to (5.1) to an invariant measure in the total variation topology: 

Theorem 5.33 Assume that (5.1) has a solution in some Hilbert space H and that there exists a 

l li 

time T such that \\S{T)\\ < 1 and such that S(T) maps Tt into the image of Qj. . Then (5.1) 
admits a unique invariant measure fioo and there exists 7 > such that 



for every probability measure v onTL with finite second moment. 
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Proof. Let V(x) = \\x\\ and denote by m the centred Gaussian measure with covariance Q t . We 
then have 

V t V{x)< ||5(t)x||+ / \\x\\n t (dx), 

JH 

which shows that the first assumption of Theorem 5.28 is satisfied. A simple variation of Exer- 
cise 3.34 (use the decomposition H = H® ker K) shows that the Cameron-Martin space of fix is 

1/2 

given by Im Qf, endowed with the norm 

||/i|| r =inf{||x|| : h = Q^ 2 x} . 

1/2 

Since we assumed that S(T) maps TC into the image of Qf, , it follows from the closed graph 
theorem that there exists a constant C such that HS^T^Ht < C\\x\\ for every x G H. It follows 
from the Cameron-Martin formula that the total variation distance between V^5 X and V^5 y is 
equal to the total variation distance between M(0, 1) and M(\\S(T)x - S(T)y\\ T , 1), so that the 
second assumption of Theorem 5.28 is also satisfied. 

Both the existence of p,^ and the exponential convergence of V\ v towards it is then a con- 
sequence of Banach's fixed point theorem in the dual to the space of measurable functions with 
||<£>||y < oo. □ 

Remark 5.34 The proof of Theorem 5.33 shows that if its assumptions are satisfied, then the map 
x i ^ VI S x is continuous in the total variation distance for t >T. 

1/2 

Remark 5.35 Since Im S(t) decreases with t and Im Q t increases with t, it follows that if 

1/2 

ImS'(t) C ImQ t ' for some t, then this also holds for any subsequent time. This is consis- 
tent with the fact that Markov operators are contraction operators in the supremum norm, so that 
if x i ^ V\ S x is continuous in the total variation distance for some t, the same must be true for all 
subsequent times. 

While Theorem 5.33 is very general, it is sometimes not straightforward at all to verify its 
assumptions for arbitrary linear SPDEs. In particular, it might be complicated a priori to determine 

1/2 

the image of Q t . The task of identifying this subspace can be made easier by the following result: 

1/2 

Proposition 5.36 The image of Q t is equal to the image of the map A t given by 

A t :L 2 ([0,t],JC) ->H, A t :h^ f S(s)Qh(s)ds . 

Jo 

Proof. Since Q t = A%A\, we can use polar decomposition [RS80, Thm VI. 10] to find an isometry 
Jt of (ker At) 1 - C H (which extends to H by setting J t x = for x £ ker A t ) such that Q X J 2 = 
A t J t . 

Alternatively, one can show that, in the situation of Theorem 3.44, the Cameron-Martin space 
of jl = A*p, is given by the image under A of the Cameron-Martin space of p,. This follows from 
Proposition 3.31 since, as a consequence of the definition of the push-forward of a measure, the 
composition with A yields an isometry between L 2 (B, fi) and L 2 (B, //). □ 

1/2 

One case where it is straightforward to check whether S(t) maps H into the image of Q t is 
the following: 
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Example 5.37 Consider the case where K, = TL, L is selfadjoint, and there exists a function 
/: R — > R_|_ such that Q = f(L). (This identity should be interpreted in the sense of the functional 
calculus already mentioned in Theorem 4.18.) 

If we assume furthermore that /(A) > for every A € R, then the existence of an invariant 
measure is equivalent to the existence of a constant c > such that (x, Lx) < — c\\x\\ 2 for every 
iGW. Using functional calculus, we see that the operator Qt is then given by 

and, for every T > 0, the Cameron-Martin norm for ht is equivalent to the norm 



f(L) 



In order to obtain convergence V\ v — > [i^ in the total variation topology, it is therefore sufficient 
that there exist constants c, C > such that /(A) > Ce~ cX for A > 0. 

This shows that one cannot expect convergence in the total variation topology to take place 
under similarly weak conditions as in Proposition 5.23. In particular, convergence in the total 
variation topology requires some non-degeneracy of the driving noise which was not the case for 
weak convergence. 

Exercise 5.38 Consider again the case K, = TL and L selfadjoint with (x, Lx) < — c\\x\\ 2 for some 
c > 0. Assume furthermore that Q is selfadjoint and that Q and L commute, so that there exists 
a space L 2 (.M,/i) isometric to TL and such that both Q and L can be realised as multiplication 
operators (say / and g respectively) on that space. Show that: 

• In order for there to exist solutions in TL, the set Aq = {A G M. : /(A) / 0} must be 
'essentially countable' in the sense that it can be written as the union of a countable set and 
a set of /i-measure 0. 

1/2 

• If there exists T > such that Im S(T) C Im Q T , then /i is purely atomic and there exists 
some possibly different time t > such that S(t) is trace class. 

1/2 

Exercise 5.37 suggests that there are many cases where, if S(t) maps H to \vaQ t for some 
t > 0, then it does so for all t > 0. It also shows that, in the case where L and Q are selfadjoint and 
commute, Q must have an orthnormal basis of eigenvectors with all eigenvalues non-zero. Both 
statements are certainly not true in general. We see from the following example that there can be 

1/2 

infinite-dimensional situations where S(t) maps H to Im Q t even though Q is of rank one! 

Example 5.39 Consider the space H = R©L 2 ([0, 1], R) and denote elements of H by (a, u) with 
a 6 R. Consider the semigroup S on H given by 



S{T)(a, u) = (a, u) , u(x) 



a for x < t 
u(x — t) for x > t. 



It is easy to check that S is indeed a strongly continuous semigroup on TL and we denote its 
generator by (0, d x ). We drive this equation by adding noise only on the first component of TL. In 
other words, we set K, = R and Ql = (1, 0) so that, formally, we are considering the equation 

da = dW(t) , du = d x u dt . 
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Even though, at a formal level, the equations for a and u look decoupled, they are actually coupled 
via the domain of the generator of S. In order to check whether S(t) maps Ti. into Tit = Im Qk^ 2 , 
we make use of Proposition 5.36. This shows that Tit consists of elements of the form 

/ h(s)xs ds , 
Jo 

where h € L 2 ([0, t]) and Xs is the image of (1, 0) under S(s), which is given by (1, 1[o,sai])- On 
the other hand, the image of S(t) consists of all elements (a, u) such that u(x) = a for x < t. 
Since one has Xs(x) = for x > s, it is obvious that Im S(t) <£_ Tit for t < 1. 

On the other hand, for t > 1, given any a > 0, we can find a function h G L 2 ([0, t]) such that 
h(x) = for x < 1 and J ' h(x) dx = a. Since, for s > 1, one has Xs{x) = 1 for every x G [0, 1], 
it follows that one does have Im S(t) C Tit for t < 1. 



6 Semilinear SPDEs 

Now that we have a good working knowledge of the behaviour of solutions to linear stochastic 
PDEs, we are prepared to turn to nonlinear SPDEs. In these notes, we will restrict ourselves to the 
study of semilinear SPDEs with additive noise. 

In this context, a semilinear SPDE is one such that the nonlinearity can be treated as a pertur- 
bation of the linear part of the equation. The word additive for the noise refers to the fact that, as 
in (5.1), we will only consider noises described by a fixed operator Q:JC ^ B, rather than by an 
operator-valued function of the solution. We will therefore consider equations of the type 

dx = Lxdt + F(x) dt + Q dW(t) , x(0) = x G B , (6.1) 

where L is the generator of a strongly continuous semigroup S on a separable Banach space B, 
W is a cylindrical Wiener process on some separable Hilbert space K,, and Q: K, — > B is bounded. 
Furthermore, F is a measurable function from some linear subspace V{F) C B into B. We will 
say that a process t x(t) G V{F) is a mild solution to (6.1) if the identity 

x(t) = S(t)x + /' S(t-s)F(x(s))ds+ f S(t - s)Q dW(s) . (6.2) 
Jo Jo 

holds almost surely for every t > 0. 
6.1 Local solutions 

Throughout this section, we will make the standing assumption that the linearisation to (6.1) (that 
is the corresponding equation with F = 0) does have a continuous solution with values in B. In 
order to simplify notations, we are going to write 

W L (t)= f S(t-s)QdW(s), 
Jo 

In the nonlinear case, there are situations where solutions explode after a finite (but possibly 
random) time interval. In order to be able to account for such a situation, we introduce the notion 
of a local solution. Recall first that, given a cylindrical Wiener process W defined on some proba- 
bility space (Q, P), we can associate to it the natural filtration {F t }t>o generated by the increments 
of W. In other words, for every t > 0, F t is the smallest cr-algebra such that the random variables 
W(s) — W(r) for s, r < t are all F t -measurable. 

In this context, a stopping time is a positive random variable r such that the event {r < T} is 
F^-measurable for every T > 0. With this definition at hand, we have: 
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Definition 6.1 A local mild solution to (6.1) is a D(F)-valued stochastic process x together with 
a stopping time r such that r > almost surely and such that the identity 



holds almost surely for every stopping time t such that t < r almost surely. 

Remark 6.2 In some situations, it might be of advantage to allow F to be a map from V(F) to B' 
for some superspace B' such that B C B' densely and such that S(t) extends to a continuous linear 
map from B' to B. The prime example of such a space B' is an interpolation space with negative 
index in the case where the semigroup S is analytic. The definition of a mild solution carries over 
to this situation without any change. 

A local mild solution (x, r) is called maximal if, for every mild solution (x, f ), one has f < r 
almost surely. 

Exercise 6.3 Show that mild solutions to (6.1) coincide with mild solutions to (6.1) with L re- 
placed by L = L — c and F replaced by F = F + c for any constant c G R. 

Our first result on the existence and uniqueness of mild solutions to nonlinear SPDEs makes 
the strong assumption that the nonlinearity F is defined on the whole space B and that it is locally 
Lipschitz there: 

Theorem 6.4 Consider (6.1) on a Banach space B and assume that Wl is a continuous B-valued 
process. Assume furthermore that F:B — > B is such that it restriction to every bounded set is Lip- 
schitz continuous. Then, there exists a unique maximal mild solution (x, r) to (6.1 ). Furthermore, 
this solution has continuous sample paths and one has lim t ^ T ||ar(£)|| = oo almost surely on the set 



If F is globally Lipschitz continuous, then r = oo almost surely. 

Proof. Given any realisation Wl € C(R+, B) of the stochastic convolution, we are going to show 
that there exists a time r > depending only on Wl up to time r and a unique continuous function 
x: [0, r) — > B such that (6.3) holds for every t < r. Furthermore, the construction will be such 
that either r = oo, or one has lim^ T ||x(t)|| = oo, thus showing that (x, r) is maximal. 

The proof relies on the Banach fixed point theorem. Given a terminal time T > and a 
continuous function g: R + -> B, we define the map M SjT : C([0, T],B) — ► C([0, T],B) by 



The proof then works in almost exactly the same way as the usual proof of uniqueness of a maximal 
solution for ordinary differential equations with Lipschitz coefficients. Note that we can assume 
without loss of generality that the semigroup S is bounded, since we can always subtract a constant 
to L and add it back to F. Using the fact that H-S^)!! < M for some constant M, one has for any 
T > the bound 




(6.3) 



{r < oo}. 




sup 

te[o,T] 



M g , T vtt) - Mg tT v(t)\\ < MT sup \\F(u(t)) - F(v(t))\\ . 

te[o,T] 



(6.4) 



Furthermore, one has 



sup \\M g>T u(t) - g(t)\\ < MT sup \\F(u(t))\\ . 



(6.5) 
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Fix now an arbitrary constant R > 0. Since F is locally Lipschitz, it follows from (6.4) and 
(6.5) that there exists a maximal T > such that M 9i t maps the ball of radius R around g in 
C([0,T],B) into itself and is a contraction with contraction constant \ there. This shows that 
M 9j t has a unique fixed point for T small enough and the choice of T was obviously performed 
by using knowledge of g only up to time T. Setting g(t) = S(t)xo + the pair (x, T), where 

T is as just constructed and x is the unique fixed point of M g ^ thus yields a local mild solution to 
(6.1). ' 

In order to construct the maximal solution, we iterate this construction in the same way as 
in the finite-dimensional case. Uniqueness and continuity in time also follows as in the finite- 
dimensional case. In the case where F is globally Lipschitz continuous, denote its Lipschitz 
constant by K. We then see from (6.4) that M 9: t is a contraction on the whole space for T < 
1/(KM), so that the choice of T can be made independently of the initial condition, thus showing 
that the solution exists for all times. □ 

While this setting is very straightforward and did not make use of any PDE theory, it never- 
theless allows to construct solutions for an important class of examples, since every composition 
operator of the form (iV(u))(0 = (/ o is locally Lipschitz on C(K, R d ) (for K a compact 
subset of R ra , say), provided that /: R d — ► R d is locally Lipschitz continuous. 

A much larger class of examples can be found if we restrict the regularity properties of F, but 
assume that L generates an analytic semigroup: 

Theorem 6.5 Let L generate an analytic semigroup onB ( denote by B a , a £ lithe corresponding 
interpolation spaces) and assume that Q is such that the stochastic convolution Wl has almost 
surely continuous sample paths in B a for some a > 0. Assume furthermore that there exists 7 > 
and 5 € [0, 1) such that, for every (5 G [0, 7], the map F extends to a locally Lipschitz continuous 
map from Bp to Bp_s that grows at most polynomially. 

Then, (6.1) has a unique maximal mild solution (x,r) with x taking values in Bp for every 
13 < & = r a A (7 + 1 - 6). 

Proof. In order to show that (6.1) has a unique mild solution, we proceed in a way similar to the 
proof of Theorem 6.4 and we make use of Exercise 4.38 to bound \\S(t — s)F(u(s))\\ in terms of 
\\F{u{s))\\- S . This yields instead of (6.4) the bound 

sup \\M 9)T u(t) - M giT v(t)\\ < MT 1 - 6 sup \\F(u(t)) - F(v(t))\\ , (6.6) 
te[u,T] te[o,T] 

and similarly for (6.5), thus showing that (6.1) has a unique £>-valued maximal mild solution (x, r). 
In order to show that x(t) actually belongs to Bp for t < r and f3 < a A 7, we make use of a 
'bootstrapping' argument, which is essentially an induction on [3. 

For notational convenience, we introduce the family of processes W£(t) = J* t S(t — 
r)Q dW(r), where a G [0, 1) is a parameter. Note that one has the identity 

W£(t) = W L (t) - S((l - a)t)W L (at) 

so that if Wl is continuous with values in B a , then the same is true for W£. 

We are actually going to show the following stronger statement. Fix an arbitrary time T > 0. 
Then, for every (3 6 [0, /3*) there exist exponents pp > 1, qp > 0, and constants a G (0, 1), C > 
such that the bound 

\\x t \\p<Cr^(l+ sup ||x s ||+ sup \\W£{s)\\p) P \ (6.7) 

v sE[at,t] 0<s<t ' 
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holds almost surely for all t G (0, T]. 

The bound (6.7) is obviously true for (3 = with pp = 1 and qp = 0. Assume now that, for 
some (3 = /3q G [0, 7], the bound (6.7) holds. We will then argue that, for any e G (0, 1 — (5), the 
statement (6.7) also holds for (3 = Pq + e (and therefore also for all intermediate values), provided 
that we adjust the constants appearing in the expression. Since it is possible to go from (3 = to 
any value of /3 < 7 + 1 — 5 in a finite number of such steps, the claim then follows at once. 

From the definition of a mild solution, we have the identity 

x t = S((l - a)t)x at + f S(t - s)F{x{s)) ds + W£(t) . 

J at 

Since (3 < 7, it follows from our polynomial growth assumption on F that there exists n > such 
that, for t G (0,T], 

WxtWp+e < ct- £ \\x at \y + \\w£(t)\y +£ + c [\t - S )-^ +s \i + \\x s \yrds 

Jot 

< C{r e + t l~e^ ) sup (l + ||x a ||^) + ||Wi (*)||^ +e 
at<s<t 

<ct- £ sup (i + \\x s \\ n p ) + \\w£(t)\y +£ . 

at<s<t 

Here, the constant C depends on everything but t and xq. Using the induction hypothesis, this 
yields the bound 

Hxtll^ < Ct- £ -"^(l + sup + sup \\W£{8)\\ p ) n ^ + ||W7, a (t)||^ +e , 

se[a!H,t] 0<s<t 

thus showing that (6.7) does indeed hold for (3 = (3q + e, provided that we replace a by a 2 and set 
P/3+e = npp and qp +£ = e + nqp. This concludes the proof of Theorem 6.5. □ 

6.2 Interpolation inequalities and Sobolev embeddings 

The kind of bootstrapping arguments used in the proof of Theorem 6.5 above are extremely useful 
to obtain regularity properties of the solutions to semilinear parabolic stochastic PDEs. However, 
they rely on obtaining bounds on the regularity of F from one interpolation space into another. 
In many important situations, the interpolation spaces turn out to be given by fractional Sobolev 
spaces. For the purpose of these notes, we are going to restrict ourselves to the analytically easier 
situation where the space variable of the stochastic PDE under consideration takes values in the 
d-dimensional torus T d . In other words, we restrict ourselves to situations where the operator 
describing the linearised evolution is endowed with periodic boundary conditions. 

This will make the proof of the embedding theorems presented in these notes technically more 
straightforward. For the corresponding embeddings with more general boundary conditions or 
even on more general manifolds or unbounded domains, we refer for example the comprehensive 
series of monographs [Tri83, Tri92, Tri06]. 

Recall that, given a distribution u G L 2 (T d ), we can decompose it as a Fourier series: 

u(x) = u ^ l{k ' x) , 
kez d 

where the identity holds for (Lebesgue) almost every x G T d . Furthermore, the I? norm of u is 
given by Parseval's identity |M| 2 = I]l li fc| 2 - We have 
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Definition 6.6 The fractional Sobolev space H s (T d ) for s > is given by the subspace of func- 
tions u e L 2 (T d ) such that 



u 



,2 def 

\h s — 



]T (1 + |A;| 2 )> fc | 2 < oc . (6.8) 



kez- 



d 



Note that this is a separable Hilbert space and that H° = L 2 . For s < 0, we define H s as the 
closure of L 2 under the norm (6.8). 

Remark 6.7 By the spectral decomposition theorem, H s for s > is the domain of (1 — A) 5 / 2 
and we have ||«||h s = ||(1 — A) s / 2 u||^2. 

A very important situation is the case where L is a differential operator with constant coef- 
ficients (formally L = P(d x ) for some polynomial P: R d — > R) and H is either an L 2 space or 
some Sobolev space. In this case, one has 

Lemma 6.8 Assume that P: R rf — > R is a polynomial of degree 2m such that there exist positive 
constants c, C such that the bound 

(-l) m+1 c\k\ 2m < P(k) < (-l) m+1 C\k\ 2m , 

holds for all k outside of some compact set. Then, the operator P(d x ) generates an analytic 
semigroup onTL = H s for every s G R and the corresponding interpolation spaces are given by 
H a = H s+2ma . 

Proof. By inspection, noting that P{d x ) is conjugate to the multiplication operator by P(ik) via 
the Fourier decomposition. □ 

Note first that for any two positive real numbers a and b and any pair of positive conjugate 
exponents p and q, one has Young's inequality 

a p b q 11 
ab<— + — , - + - = l. (6.9) 
p q p q 

As a corollary of this elementary bound, we obtain Holder's inequality, which can be viewed as a 
generalisation of the Cauchy-Schwartz inequality: 

Proposition 6.9 (Holder's inequality) Let (M. , p) be a measure space and let p and q be a pair 
of positive conjugate exponents. Then, for any pair of measurable functions u,v:Ai R. one 
has 

/ |n(x)u(x)| p(dx) < \\u\\ p \\v\\ q , 
J M 

for any pair (p, q) of conjugate exponents. 

Proof. It follows from (6.9) that, for every e > 0, one has the bound 



e p \\u\\P, 



/ \u(x)v (x) fi(dx) < 

JM P 



P - + 



qei 
- i-i 

Setting e = \\v\\q\\u\\p concludes the proof. □ 
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One interesting consequence of Holder's inequality is the following interpolation inequality 
for powers of selfadjoint operators: 

Proposition 6.10 Let Abe a positive definite selfadjoint operator on a separable Hilbert space 
H and let a G [0, 1]. Then, the bound \\A a u\\ < ||Au|| Q |H| 1_a holds for every u G V(A a ) C H. 

Proof. The extreme cases a G {0, 1} are obvious, so we assume a G (0, 1). By the spectral 
theorem, we can assume that H = L 2 (M,fi) and that A is the multiplication operator by some 
positive function /. Applying Holder's inequality withp = 1/a and q = 1/(1 — a), one then has 

\\A a u\\ 2 = J f 2a (x)u 2 (x) n(dx) = J \fu\ 2a (x)\u\ 2 - 2a (x)n(dx) 

< (/ f\x)u\x) ^dx)) a [j u\x) ti{dx)) l ~ a , 

which is exactly the bound we wanted to show. □ 

An immediate corollary is: 

Corollary 6.11 For any t > s and any r G [s, t], the bound 

Nik? < IMI^IMIh" (6-10) 

is valid for every u G H t . 

Proof. Apply Proposition 6.10 with H = H s , A = (1 - A)^, and a = (r — s)/(t - s). □ 



able functions and any exponents Pi > 1 such that Ya=i Pi 1 = 1> one has the bound 



Exercise 6.12 As a consequence of Holder's inequality, show that for any collection of n measur- 

lents pi > 1 such that Ya=i P^ 1 

Ul(x)- --Unix^ fl{dx) < \\ui\\ pi ■ ■ ■ \\u n \\ pn . 



L 



M 

Following our earlier discussion regarding fractional Sobolev spaces, it would be convenient 
to be able to bound the LP norm of a function in terms of one of the fractional Sobolev norms. It 
turns out that bounding the L°° norm is rather straightforward: 

Lemma 6.13 For every s > |, the space H s (T d ) is contained in the space of continuous functions 
and there exists a constant C such that < C||-u||h s - 



Proof. It follows from Cauchy-Schwarz that 

kez d fcez a kez d 



E < (Ed + l*f) s Kl 2 ) 1/2 (E d + w 2 r s y 



Since the sum in the second factor converges if and only if s > |, the claim follows. □ 

Exercise 6.14 In dimension d = 2, find an example of an unbounded function u such that 

IMIi/ 1 < oo. 



Exercise 6.15 Show that for s > % , H s is contained in the space C a (T d ) for every a<s- 



2- 
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As a consequence of Lemma 6.13, we are able to obtain a more general Sobolev embedding 
for all L p spaces: 

Theorem 6.16 (Sobolev embeddings) Let p e [2, oo]. Then, for every s > | - |, the space 
H s (T d ) is contained in the space L p (T d ) and there exists a constant C such that \\u\\u> < 
C\\u\\h s - 

Proof. The case p = 2 is obvious and the case p = oo has already been shown, so it remains to 
show the claim for p G (2, oo). The idea is to divide Fourier space into 'blocks' corresponding to 
different length scales and to estimate separately the LP norm of every block. More precisely, we 
define a sequence of functions u (ri) by 

u-\x) = u , u in \x)= ]T u k e 1 ^, 

2 n <\k\<2 n + 1 

so that one has u = J2n>-i F° r n > 0, one has 

\\u (n) \\ p LP < ||« (n) ||i 2 ||u (n) ||^. (6-11) 

Choose now s' = ^ + e and note that the construction of u (n \ together with Lemma 6.13, guaran- 
tees that one has the bounds 

\\u {n) \\ L 2 < 2~ ns \\u in) \\ H ° , ||n (n) || L - < C\\u {n) \\ Hs , < C2 n(s '- s) \\u {n) \\ H s ■ 
Inserting this into (6.11), we obtain 

II (n)\\ / n\\ (n)|| o«(( s '- s )— - — ) n\\ (n)\\ o"(--|-«) ^ r<\\ II o™(--|~ s ) 

1 1 ix < C\\u y j \\h s 2 p p = (^\\u \\h s 2 y p 2 ' < C\\u\\hs2 y p 2 ' . 

It follows that || it || lp < I «o I + 2~2 n >o \\ u ^\\lp < C|MI-ff s > provided that the exponent appearing 
in this expression is negative, which is precisely the case whenever s > | — ^. □ 

Remark 6.17 Forp / oo, one actually has H s (T d ) C D>{T d ) with s = f - j, but this borderline 
case is more difficult to obtain. 

Combining the Sobolev embedding theorem and Holder's inequality, it is eventually possible 
to estimate in a similar way the fractional Sobolev norm of a product of two functions: 

Theorem 6.18 Let r, s and t be positive exponents such that s A r > t and s + r > t + ^. Then, 
ifu£ H r and v £ H s , the product w = uv belongs to LL 1 . 

Proof. Define u (n) and v (m) as in the proof of the Sobolev embedding theorem and set w^ m,n) = 
u (m V n) . Note that one has w ( ™' n) = if \k\ > 2 3+( - myn \ It then follows from Holder's inequality 
that if p, q > 2 are such that p^ 1 + q^ 1 = \, one has the bound 



'J (m ' n) \\m < C2 t(mVn) \\w (m > n) \\ L 2 < C2' (mVn) ||u (m) || LP ||t> (n) | 



Assume now that m > n. The conditions on r, s and t are such that there exists a pair (p, q) as 
above with 

d d d d 

r>t+---, s> --- . 
2 p 2 q 
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In particular, we can find some e > such that 

\\u (m) \\ L v < C\\u {m) \\ H r- t -e < C2- m{t+£) \\u\\ H r , \\v (n) \\ LP < C 1 1 v {n) 1 1 H s -e < C2- m£ \\v\\ H s ■ 
Inserting this into the previous expression, we find that 

||„,,(m,n)|| ^ y^ro — rne: — Tie 11^,11 ll„,ll 

Since our assumptions are symmetric in u and v, we obtain a similar bound for the case m < n, 
so that 

\\w\\h* < \\ w(m,n) \\Ht < C\\u\\ H r\\u\\H' 1~ m£ ~ ne < C\\u\\ht\\u\\h* , 

m,n>0 m,n>0 

as requested. □ 

Exercise 6.19 Show that the conclusion of Theorem 6.18 still holds if s = t = r is a positive 
integer, provided that s > i. 



Exercise 6.20 Similarly to Exercise 6.12, show that one can iterate this bound so that if Sj > s > 

-i 

2 

\U1 ■ --UnWs < C\\ 

u i lUi 



are exponents such that J2i s i > s + ( " , then one has the bound 



Hint: The case s > | is simple, so it suffices to consider the case s < | . 

The functional inequalities from the previous section allow to check that the assumptions of 
Theorems 6.4 and 6.5 are verified by a number of interesting equations. 

6.3 Reaction-diffusion equations 

This is a class of partial differential equations that model the evolution of reactants in a gel, de- 
scribed by a spatial domain D. They are of the type 

du = Audt + f oudt + QdW(t) , (6.12) 

where u(x, t) G R d , x 6 D, describes the density of the various components of the reaction at 
time t and location x. The nonlinearity /: K d —> K d describes the reaction itself and the term Ait 
describes the diffusion of the reactants in the gel. The noise term Q dW should be interpreted as 
a crude attempt to describe the fluctuations in the quantities of reactant due both to the discrete 
nature of the underlying particle system and the interaction with the environment 3 . 

Equations of the type (6.12) also appear in the theory of amplitude equations, where they ap- 
pear as a kind of 'normal form' near a change of linear instability. In this particular case, one 
often has d = 2 and f(u) = ku — u\u\ 2 for some k G R, see [BHP05]. A natural choice for the 
Banach space B in which to consider solutions to (6.12) is the space of bounded continuous func- 
tions B = C(D,R d ) since the composition operator u ^ f o u (also sometimes called Nemitskii 
operator) then maps B into itself and inherits the regularity properties of /. If the domain D is 
sufficiently regular then the semigroup generated by the Laplacian A is the Markov semigroup for 
a Brownian motion in D. The precise description of the domain of A is related to the behaviour 
of the corresponding Brownian motion when it hits the boundary of D. In order to avoid techni- 
calities, let us assume from now on that D consists of the torus T n , so that there is no boundary to 
consider. 



3 A more realistic description of these fluctuations would result in a covariance Q that depends on the solution u. 
Since we have not developed the tools necessary to treat this type of equations, we restrict ourselves to the simple case 
of a constant covariance operator Q. 
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Exercise 6.21 Show that in this case, A generates an analytic semigroup on B = C(T",R d ) and 
that for a £ N, the interpolation space B a is given by B a = C 2a (T n , R d ). 

If Q is such that the stochastic convolution has continuous sample paths in B almost surely 
and / is locally Lipschitz continuous, we can directly apply Theorem 6.4 to obtain the existence 
of a unique local solution to (6.12) in C(T n ,R d ). We would like to obtain conditions on / that 
ensure that this local solution is also a global solution, that is the blow-up time r is equal to infinity 
almost surely. 

If / happens to be a globally Lipschitz continuous function, then the existence and uniqueness 
of global solutions follows from Theorem 6.4. Obtaining global solutions when / is not globally 
Lipschitz continuous is slightly more tricky. The idea is to obtain some a priori estimate on some 
functional of the solution which dominates the supremum norm and ensures that it cannot blow up 
in finite time. 

Let us first consider the deterministic part of the equation alone. The structure we are going 
to exploit is the fact that the Laplacian generates a Markov semigroup. We have the following 
general result: 

Lemma 6.22 Let Vt be a Feller 1 Markov semigroup over a Polish space X. Extend it to Cb(X, R d ) 
by applying it to each component independently. Let V: R d — > R+ be convex (that is V(ax + (1 — 
a)y) < aV(x) + (1 - a)V(y)for all x, y G R d and a e [0, 1]) and define V: C b (X, R d ) -► R+ 
by V(u) = sup xeX V(u(x)). Then V(V t u) < V(u)for every t > and every u £ C b (X, R d ). 

Proof. Note first that if V is convex, then it is continuous and, for every probability measure p, on 
R d , one has the inequality 

V( [ xp(dx))<[ V(x)fi(dx). (6.13) 

VR d ' jR d 

One can indeed check by induction that (6.13) holds if fi is a 'simple' measure consisting of a 
convex combination of finitely many Dirac measures. The general case then follows from the 
continuity of V and the fact that every probability measure on R d can be approximated (in the 
topology of weak convergence) by a sequence of simple measures. 

Denote now by Vt(x, • ) the transition probabilities for Vt, so that Vtu is given by the formula 
(Vtu){x) = f x u(y)Vt(x, dy). One then has 

V(V t u) = sup V ( [ u(y) V t (x, dy)) = supV([ v {u*V t (x, ■ ))(dv)) 
xex K Jx ' xex VR d 1 

< sup / V(v)(u*V t (x,-))(dv)= sup [ V(u(y))V t (x,dy) 
xex JR d xexJx 

< sup V(u(y)) = V(u) , 
y ex 

as required. □ 

In particular, this result can be applied to the semigroup S(t) generated by the Laplacian in 
(6.12), so that V(S(t)u) < V(u) for every convex V and every u G C(T n ,R d ). This is the main 
ingredient allowing us to obtain a priori estimates on the solution to (6.12): 



4 A Markov semigroup is Feller if it maps continuous functions into continuous functions. 
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Proposition 6.23 Consider the setting for equation (6.12) described above. Assume that Q is 
such that Wa has continuous sample paths in B = C(T n , R d ) and that there exists a convex twice 
differ entiable function V: R rf — > R + such that limi^i^^ V(x) = oo and such that, for every 
R > 0, there exists a constant C such that (VV(x), f(x + y)) < CV{x) for every x € R d and 
every y with \y\ < R. Then (6.12) has a global solution in B. 

Proof. We denote by u(t) the local mild solution to (6.12). Our aim is to obtain a priori bounds on 
V(u(t)) that are sufficiently good to show that one cannot have lim^ r \\u(t)\\ = oo for any finite 
(stopping) time r. 

Setting v(t) = u(t) — W&{t), the definition of a mild solution shows that v satisfies the equation 

v (t) = e At v(0) + f e A(t - s \f o ( V (s) + W A (s))) ds = e At v(0) + f e A(t - s) F(s) ds . 
Jo Jo 

Since t ^ v(t) is continuous by Theorem 6.4 and the same holds for Wa by assumption, the 
function t ^ F(t) is continuous in B. Therefore, one has 

lim U [ k e A(h ~ s) F(s)ds - he Ah F(0)) = . 
h^O h \Jo ' 

We therefore obtain for V(v) the bound 

Urn sup h^iVivit + h)) - V(v(t))) = lim sup h,- l (V(v(t) + hF(t)) - V(v(t))) . 

h^O h^O 

Since V belongs to C 2 by assumption, we have 

V(v(t) + hF(t)) = sup (V(v(x, t)) + h(VV(v(x, t)), F(x, t))) + 0(h 2 ) . 
xeT n 

Using the definition of F and the assumptions on V, it follows that for every R > there exists a 
constant C such that, provided that < R, one has 

lim sup h^iVivit + h)) - V(v(t))) < CV(v(t)) . 

h^O 

A standard comparison argument for ODEs then shows that V(v(t)) cannot blow up as long as 
||Wa(£)|| does not blow up, thus concluding the proof. □ 

Exercise 6.24 In the case d = 1, show that the assumptions of Proposition 6.23 are satisfied for 
V(u) = u 2 if / is any polynomial of odd degree with negative leading coefficient. 

Exercise 6.25 Show that in the case d = 3, (6.12) has a unique global solution when we take for 
/ the right-hand side of the Lorentz attractor: 

/ a(u 2 - ui) \ 
f(u) = ui(q -u 3 )-u 2 , 
V u\u 2 - l3u 3 J 

where q, a and (3 are three arbitrary positive constants. 
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6.4 The stochastic Navier-Stokes equations 

The Navier-Stokes equations govern the motion of an idealised incompressible fluid and are one 
of the most studied models in the theory of partial differential equations, as well as in theoretical 
and mathematical physics. We are going to use the symbol u(x, t) to denote the instantaneous 
velocity of the fluid at position x € R d and time t, so that u(x, t) G R d . With these notations, the 
deterministic Navier-Stokes equations are given by 

d t u = vAu - (u ■ V)u -Vp, div u = , (6.14) 

where the (scalar) pressure p is determined implicitly by the incompressibility condition div u = 
and v > denotes the kinematic viscosity of the fluid. In principle, these equations make sense 
for any value of the dimension d. However, even the deterministic equations (6. 14) are known to 
have global smooth solutions for arbitrary smooth initial data only in dimension d = 2. We are 
therefore going to restrict ourselves to the two-dimensional case in the sequel. As we saw already 
in the introduction, solutions to (6.14) tend to as times goes to oo, so that an external forcing is 
required in order to obtain an interesting stationary regime. 

One natural way of adding an external forcing is given by a stochastic force that is white in 
time and admits a translation invariant correlation function in space. In this way, it is possible 
to maintain the translation invariance of the equations (in a statistical sense), even though the 
forcing is not constant in space. We are furthermore going to restrict ourselves to solutions that 
are periodic in space in order to avoid the difficulties arising from partial differential equations in 
unbounded domains. The incompressible stochastic Navier-Stokes equations on the torus R 2 are 
given by 

du = vAu dt-(u- V)u dt-Vpdt + Q dW(t) , div u = , (6.15) 

where p and v > are as above. In order to put these equations into the more familiar form 
(6.1), we denote by II the orthogonal projection onto the space of divergence-free vector fields. In 
Fourier components, II is given by 

^u) k = u k - k -^0L. (6.16) 

(Note here that the Fourier coefficients of a vector field are themselves vectors.) With this notation, 
one has 

du = vAu dt + n(u • V)it dt + Q dW(t) = Au dt + F(u) dt + Q dW(t) . 

It is clear from (6. 16) that II is a contraction in any fractional Sobolev space. For t > 0, it therefore 
follows that 

\\F(u)\\ H t < ||u||h.||Vu||h'- < C\\uf H s , (6.17) 

provided that s > t V (| + \ + |) = t V (| + 1). In particular, this bound holds for s = t + 1, 
provided that t > 0. 

Furthermore, in this setting, since L is just the Laplacian, if we choose H = H s , then the 
interpolation spaces 7i a are given by H a = H s+2a . This allows us to apply Theorem 6.5 to show 
that the stochastic Navier-Stokes equations admit local solutions for any initial condition in H s , 
provided that s > 1, and that the stochastic convolution takes values in that space. Furthermore, 
these solutions will immediately lie in any higher order Sobolev space, all the way up to the space 
in which the stochastic convolution lies. 

This line of reasoning does however not yield any a priori bounds on the solution, so that it 
may blow up in finite time. The Navier-Stokes nonlinearity satisfies (u, F(u)) = (the scalar 



74 



Semilinear SPDEs 



product is the L 2 scalar product), so one may expect bounds in L 2 , but we do not know at this 
stage whether initial conditions in L 2 do indeed lead to local solutions. We would therefore like 
to obtain bounds on F(u) in negative Sobolev spaces. In order to do this, we exploit the fact that 
H~ s can naturally be identified with the dual of H s , so that 

\\F(u)\\ H -s = supjy F(u)(x)v(x)dx , v £ C°° , \\v\\ H ' < l} ■ 

Making use of the fact that we are working with divergence-free vector fields, one has (using 
Einstein's convention of summation over repeated indices): 

J F(u)v dx = — J VjUidiUj dx < \\v\\u> || Vn||^2 \\u\\l<i , 

provided that p, q > 2 and | + | = \ . We now make use of the fact that \u\li < C q 1 1 Vw 1 1 2 for 
every q € [2, 00) (but q = 00 is excluded) to conclude that for every s > there exists a constant 
C such that 

||F(u)||_ a < C||Vu||| 2 . (6.18) 

In order to get a priori bounds for the solution to the 2D stochastic Navier-Stokes equations, 
one can then make use of the following trick: introduce the vorticity w = V A u = d\u 2 — d^u,\. 
Then, provided that J u dx = (which, provided that the range of Q consists of vector fields with 
mean 0, is a condition that is preserved under the solutions to (6.15)), the vorticity is sufficient to 
describe u completely by making use of the incompressibility assumption div u = 0. Actually, the 
map w i-> u can be described explicitly by 

u k = {Kw) k = -^j^ , (fei, k 2 ) ± = (-fa, h) . 

This shows in particular that K is actually a bounded operator from H s into H s+1 for every s. It 
follows that one can rewrite (6.15) as 

dw = vAw dt + (Kw ■ V)io dt + Q dW(t) = Aw dt + F(w) dt + Q dW(t) . (6.19) 

Since F(w) = V A F(Kw), it follows from (6.18) that one has the bounds 

HFMll-i-, < C\\w\\ 2 L2 , 

so that F is a locally Lipschitz continuous map from 1? into H s for every s < — 1. This shows 
that (6.19) has unique local solutions for every initial condition in L 2 and that these solutions 
immediately become as regular as the corresponding stochastic convolution. 
Denote now by Wl the stochastic convolution 

W L (t)= C e A(t - s) QdW(s) , 
Jo 

and define the process v(t) = w(t) — Wl(£). With this notation, v is the unique solution to the 
random PDE 

d t v = uAv + F(v + W L ) . 

It follows from (6.17) that \\F(w)\\ H -s < C\\w\\ 2 Ha , provided that s > 1/3. For the sake of 
simplicity, assume from now on that Wl takes values in H 1 / 2 almost surely. Using the fact that 
(v, F(v)) = 0, we then obtain for the L 2 -norm of v the following a priori bound: 

d t \\ v \\ 2 = -2v\\ Vv || 2 - 2(W L , F(v + W L )) 
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< -2v\\Vv\\ 2 + 2\\W L \\ Hl/2 \\v + W L \\ 2 Hl/2 

< -2v\\Vv\\ 2 +4\\W L \\ Hl/2 (\\vf Hl/2 + \\W L f Hl/2 ) 

< -2v\\Vv\\ 2 +4||WL|| ifl/2 (||u||||Vu|| + ||Wi|&i /2 ) 

< ^l|Wi||^ 1/2 |k|| 2 + 2||Ty L ||^ 1/2 , (6.20) 

so that global existence of solutions then follows from Gronwall's inequality. 

This calculation is only formal, since it is not known in general whether the L 2 -norm of v is 
differentiable as a function of time. The bound that one obtains from (6.20) can however be made 
rigorous in a very similar way as for the example of the stochastic reaction-diffusion equation, so 
that we will not reproduce this argument here. 
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